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NOVEL HEDGEHOG-DEWVED POLYPEPTTOES 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

This invention relates generally to the field of protein processing and protein signalling 
pathways and specifically to two novel proteins having distinct activities, vAdch are 
derived fi^om a common hedgehog protein precursor. 

2. Description of the Related Art 

Embryologists have long performed ejq)erimental manipulations that reveal the striking 
abilities of certain structures in vertebrate embryos to impose pattern upon surrounding 
tissues. Speculation on the mechanisms underlying these patterrang e£&cts usually 
centers on the secretion of signaling molecule that elicits an appropriate response from 
the tissues begin patterned. More recent work aimed at the identification of such 
signaling molecules implicates secreted proteins encoded by individual rosmbers of a 
small number of gene femilies. One such fiunily of proteins which may have an 
influential, eflfect upon patterning activities are those proteins encoded by the hedgehog 
gene fiunily. * : 

The hedgehog (hh) gene was initially identified based on its requirement for normal 
segmental patterning in Drosophila (Nusslein-Volhard, C. & Wieschaus, E, Nantre 
287:795-801, 1980). Its fimctions include local signaling to coordinate the idMtities of 
adjacent cells within early embryonic stents (Hooper, J.E., & Scott, M P. Early 
Embryonic Development of Animals, pp. 1-48, 1992) and a later fijnction in cuticle 
patterning that extends across many cell diameters (Heerhskerk, J. & DiNardo, S., Cell, 
26:449-460, 1994). The hh gene also fimctions in the patterning of imaginal preciu-sors 
of adult structures, including the sq>pendages and the eye (Mohler, J. Genetics, 120: 1061- 
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1072, 1988; Ma, eiaL. Cell, 25:927-938. 1993; Heberldn. et al., Cell, 75:913-926, 1993; 
Tabata, T. & Kornbcrg, T.D., Cell, 26:89-102, 1992; Easier, K. & Struhl, G,, Nature, 
3^:208-214, 1994). Genetic and molecular evidence indicates that hec^fK>g protems 
are secreted and function in extraceQular signaling (Mohler, J., supra, Lee, et al, Cell, 
s 21:33-50, 1992; Taylor, et al, Meek Dev., 42:89-96, 1993). 

In vertebrates activiti^ encoded by hh homologues have been implicated in ante- 
rior/posterior patterning of the limb (Riddle, et aL, Cell, 25:1401-1416, 1993; Chang, et 
al. Development, 120:3339, 1994), and in dorsal/ventral patterning of the noiral tube 
(Echelard, etal. Cell, 25:1417-1430, 1993; Krauss, etal. Cell, 25:1431-1444, 1993; 
10 Roelink, et al, Cell, 76:761-775, 1994), 

The vertebrate ventral midbrain contains n^ons whose degeneration or abnormal 
fimcdon are linked to a number of diseases, inchiding Paiidnson*s disease and schizophre- 
nia. It is known that motor neurons develop in close proximity to the floor plate in the 
ventral midbrain, \4idbrain projections to the striatum are involved in the control of 

15 voluntary movement (Bjoiklund and lindvall, In: Handbook of Chemical Neuroanatomy, 
eds., Borklund, et al., Amsterdam: Elsevier, pp5S-122, 1984) and loss of these neurons 
results in the motor disorders of Parkinson's disease (Hirsch, et al. Nature, 234:345, 
1 988). Midbrain dopaminergic neurons that innervate lirabic structures and the cortex 
influence emotional and cognitive bdiavior, respectively, and abnormal function of these 

20 n^rons has been associated with schizophrenia and drug addiction (Seeman, et al,, 
Nature, 365:441,1993), 

While the molecular nature of the fectors that specify jieuronal cell &te have not been 
established, members of the transforming growth factor-p (TGF-P) (Lyons, et al, , Trends 
in Genetics, 2:40S, 1991) or the hedgdiog protein fcmily (Smith, ^J^^^ 1994) 
25 may possess the characteristics expected fi-om such &ctors as they parfictpate in 
specification of cell fitte, mediate inductive interactions between tissues, and in many 
cases act at a distance of only a few cell diameters. 
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Tte present invention establishes that hh actr>dties encoded by these genes play a oucial 
role in early patterning of the developing eye and in patterning of the brain. For the firtf 
time, the invention shows that internal cleavage of hedgehog protein product is critical 
for fiiU fimction, and that the two novel products of this auto-proteolytic cleavage display 
5 distinguishid>le activities, thus demonstrating that hh ^gnaling activity is a composite 
efifect of two separate agnaling proteins that derive fix>m a common hh protein precursor. 
In so doi^g, the invention provides the means for spedfic patterning and proliferation of 
desired neuronal cell types for addressing disorders which arise from neuronal 
degeneration or abnormal fimction. 
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SUMMARY OF THE INVENTION 

The present invention is based on the seminal discx>very that hedgehog protons undergo 
auto-proteolyttc cleavage vMdti results in two sq>arate proteins haying distinct functional 
and structural characteristics. The two polypeptides, referred to as the *T^" and "C" 

5 fragments of hedgehog, or N-terminal and C-terminal fragments, respeaively, are 
produced after specific deavage at a G*CF ^te recognized by the outoproteolytic domain 
in the native protein. 

■ ■ ^ 

Thus, in one embodiment, the invention provides a substantially pure polypeptide 
characterized by having an amino acid sequence derived from amino terminal amino 
10 acids of a hedgehog protein and having at its caiboxy terminus, a Gl CP cleavage site 
specifically recognized by a proteolytic activity of the carboxy terminal fragment of the 
native hedgehog polypeptide. 

In another embodiment, the invention provides a substantially pure polypeptide 
characterized by having an amino add sequence derived from carboxy traxiinal amino 
15 acids of a hedgehog protein and having at its amino terminus, a GI CF cleavage site 
specifically recognized by a proteolytic activity of the carboxy ternunal fragment of the 
native ijedgehog polypeptide. 

The invention also provides a method for modulating proliferation or differentiation of 
neuronal cells, comprisng contacting the c^Us with a hedgehog polypeptide The native 
20 hedgehog polypeptide, the N, or the C fragment, or fimctional fragments derived 
therefrom, are most usefid for the induction of proliferation or diff^entiatipn of noironal 
cells substantially derived fix>m floor plate n^onal cells. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows processing of the hh protein by immunoblots (A,C) with antibodies 
against amino (Abl) and caiboxy-terminal (Ab2) epitope. FIGURE IB and D are blots 
of samples immunopredpitated v^th Abl (B, lanes 7-9), Ab2 (D, lanes 19-21), or pre- 
s immune5erum(B, lanes 10-12, and D» lanes 22-24). 

FIGURE IE shows a schematic illustration of the hedgehog cleavage mechanism. 

FIGURE 2 shows sequence dmilarity between hh proteins and serine proteases, hh 
protdn sequences are aligned to readues 323 to 329 of the D. meianogaster protein and 
numbered as portions 1 to 7 (group A). The catalytic histidines of mammalian serine 
10 protdnases (group B) are aligned to the invariant histidine at position 7 in hh proteins. 

FIGURE 3 shows autoproteolysis of the hh protdn. 3 A shows a coonuisie blue stained 
polyacrylamide gel showing production and purification of His^U and 1^-^29a 
protdns fixim £. colL Samples were molecular weight markers (lanes 1 and 2); lysates of 
£1 coli cells carrying the Ifis6-U expression construct without (lane 3) and with (lane 4) 

15 induction by IPTG, purified HisrU protein (lane 5); lysates of K coli cells that cany the 
HiS6*U|Q29A e)q>ression construct without (lane 6) and with (lane 7) induction by IPTG; 
purified IfivUH329A protein (lane 8). FIGURE 3(B) is an immuiioblot detected with Ab2 
showing transfected S2 cells induced to express hh (lane 1); His«-U and Hi; -Us29a 
proteins incubated in cleavage reaction buff^ for 0 hours (Isuies 2 and S), for 20 hours 

20 Oanes 3 and 6), and hours m the presence of 20 mM TAME (a serine protease 
inhibitor) (lanes 4 and 7)r^ ' 

FIGURE 4 shows autoproteolytic fimctions of Drosophila (4 A-C) and zebr^sh (D) hh 
proteins map t the carboxy terminal fi:agments by in vitro translations f wild-type and 
mutant hh proteins. The locations of mutations and cleavage sites (arrows) in these 
25 proteins are illustrated schematically in 4E. 
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FIGURE 5 shows immunoblots showing heat shock induced expression of wild type and 
H329A mutant hh protdns in Drosophilia embryos (A) and (B) are inmiunoblots 
developed using Abl and Ab2 antibodies, respectively. Lanes 1 and 6, induced 
untransfected S2 cells; lanes 2 and 7, transfected S2 cells induced to express hh, lanes 3 
5 and 8, heat shocked wild-type embryos; lanes 4 and 9, heat shocked hshh embryos; lanes 
5 and 10, heat shocked hshh H329A embryos. 

FIGURE 6 shows in sihi hybridization showing the embryonic efFects of ubiquitously 
expressed wild type and H329A hh proteins. FIGUEtE 6 shows the embryonic 
distribution of wingless (wg) RNA as revealed by in situ hybridization is shown in (A) 

10 wild-type (homozygous y h^""), (B) hshh, and (C) hshh H329A embryos that were 
exposed to two 10 minute heat shocks separated by a 90-minutc recovery period (33). 
Wld-type anhryos showed little change in wg expression, whereas the wild-type protein 
and, to a lesser extent, the H329A protein each induced ectopic wg expression (Table 1). 
Panels (D), (E). and (F) show the doreal surfices of/ w'''\ hshh, and hshh H329A larvae, 

15 re^ectivdy, at the levd of the fourth abdominal segment. These larvae were shocked for 
30 minutes as embryos and allowed to complete embryogenesis. Cuticle cell types (1°, 
2% 3 % and O are labeled as described (J. Heemskerk and S. 
1994). Note the expansion of 2'' cell types (naked cuticle) at the expense of 3"" and some 
4 types in the hs/isfr embryo (E) under conditions where the phenotype of hs/iA H329A 

20 embryos (F) is identical to tHkt of control embryos (D). 

FIGURE 7 shows X-gal staining to show imaginal disc effects of ubiquitous wild type 
and H329 hh proteins. X-gal staining was used to follow expression of wg (A-C) or 
(D-L) in imaginal discs of late third-instar larvae that carry Mg-lacZ ox c^p-IacZ reporter 
genes. Leg (A-F), wing (GJ) and eye-antennal discs (J-L) from controMarvae (A, D, G, 
» J), larvae carrying the hshh transgene (B, E, H, K) and larvae carrying the hsWf H329A 
transgene (C, F, I, L) are displayed. In all panels anterior is to the left. 
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FIGURE 8 (A) and (B) are inununoblots of cell pellets (lane 1 ) or supematants (lane 2) 
from transfected S2 cell cultures expressing HH protein, developed with Abl (A) and 
Ab2 (B). Samples in each lane were from the same volume of resu^ended total culture. 
Whereas N ranained mostly associated with the ceU pellet (compare lanes I and 2 in A), 

5 C was nearly quantitatively released into the supernatant (compare lanes 1 ahd 2 in B). 
U displayed partitionmg properties in between those of N and C (A and B). (C) 
demonstrates the heparin binding activity of various HH protein species generated by in 
vitro translations with microsomes (38). Samples were: total translation mbc (lane 1); 
supernatant after incubation with heparin agarose or agarose (control) beads (lanes 2 and 

10 4); and matoial eluted from heparin agarose or agarose beads after washing (lanes 3 and 
5). F, U, Nffi and N fragments arc depleted from reactions incubated with heparin agarose 
but not agarose beads (conq>are lanes 2 and 4 to 1), and the same species subsequently 
can be eluted from the heparin agarose but not the agarose beads (compare lanes 3 and 
5 with lane 1). 

15 FIGURE 9 shows the deferential localizations of N and C in embiyos by in situ 
localization of the hh transcript. Fig. 9 (A) is shovm in comparison to the distribution of 
N and C epitopes detected with Abl and Ab2 in panels (B) and (C), respectively. Note 
that the disMbution of N and C q)itopes span approximately one-third and one-half of 
each segmental unit respectively, vdiile the transcript is limited to approximaiely 

20 one-quarter of each unit. In CD), the localization of C epitopes m embiyos homozygous 
for the hh^^^ allele is detected with the use of Ab2. C epitopes in this mutant, which 
displays inqiaired auto-proteplytic aaivity (see tCTCt), are more restricted, and resemble 
the wild-type localization of N. Homo^gous A/r**^ embryos were identified by loss of a 
marked balancer from a heterozygous parent stock. AU embryos are at mid to late stage 

25 9 (extended germ-band). 



FIGURE 10 ^ows a signal relay ver^s dual fimction models for hh protein action. In 
Fig. 10 (AX the long-range ^fects of hh signaling are achieved indirectly through short- 
ran^ induction of a second signaling molecule (X). Based on its biochemical properties 
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and its restricted tissue localization, N is presumed to represent the active short-range 
signal while the role of C would be limited to supplying the catalytic machinery required 
for biogenesis of N. In (B), the long- and short-range signaling functions of hh are 
supplied by the N and C proteins derived by internal auto-proteolysis of the U precursor. 
5 N is implicated in short-raitge signalmg by retention near its cellular site of synthesis, 
while C is less restricted in its distribution and would execute lorig-range signaling 
fiuictions. In both models, auto-proteolysis is required to generate fiilly active signaling 
proteins. 

FIGURES 10 C and D show an immunoblot of the N fragment synthesized from a wild 
10 type construct (C) or a consruct lacking the C domain (D). 

FIGURES 11 A and B show tte nucleotide and deduced amino acid sequences for partial 
human hh clones. 

FIGURE 12 A and B show in vitro cleavage reactions of a Drosophila hh protdn 
produced in EL coli and purified to homogeneity. FIGURE 12, Panel A shows a time 

15 course of cleavage after initiation by addition of DTT. Panel B shows incubations qf 
concentrations tangii^ over three ordo" of magnitude for a fixed time period (four hours), 
with no difference in the extent of conversaon to the cleaved form. Panel C shows the 
sequence around the cleavage site as determined by amino-tcrminal sequence of the 
cleaved fragment C. The cleavage site is denoted by the arrow, and the actual residues 

20 sequenced by Edman degradation of the C fragment are underlined. Panel C also shows 
an alignment of all published vertebrate hh sequences plus some of unpublished 
sequences from fish and Xenopus. The sequences shown correspond to the region of 
I>osqphila hh wha% the cleavage occurs, and demonstrates the absolute conservation of 
the Gly-Cys-Phe sequence at the site of cleavi^e. Panel D shows a SDS-P AGE gel 

25 loaded with in vitro transoiption/translation reactions as described in the previous 
Examples, using various hh genes as ten^ilates. dhh is Drosophila, twhh and zMh are 
the twiggy-winkle and sonic hh genes of the zdirafish, and mshh is the sMt/Hgh-l/vkh-l 
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gene of the mouse. Panel E shows that Edman degradation of the C fi:agments releases 
^^S counts on the fist but not subsequent rounds for all these proteins, indicating that the 
site of autoproteolytic cleavage for all of these hh proteins is the amide bond to the 
amino-terminal ^de of the Cys residue that forms the center of the conserved dy-Cys- 
5 Phe sequence highlighted in pand C. 

FIGURE 13 shows the predicted amino add sequences are shown in single letter code. 
13(a) ^ows sequences common to five distinct A^like genes are shown with a dot 
indicating identity with the correq)onding residue of zebrafish twiggy-mnkle, 13(b) 
^ows amino add sequences of twhh and shh are aligned to those of the sonichkh-I class 
10 from chick and mouse. The amino-temunal hydrophobic stretch common to all four hh 
genes is shaded. The astoisk (*) denotes invariant amino acid residues associated with 
the proteolytic domain of C fi^agment from various spedes. 1 3(c) shows percent identity 
of residues carboxy-terminal to the Iqrdrophobic r^on. 

FIGURE 14 shows a comparative CTcpression of twhh, shh, and pcoc-l during zebrafish 
15 embryogeneds. 

FIGURE IS shows the effects of ectdpic on zebrafish development. Wild 
zebrafish, ZXsmo rer/p, Ekkwill Waterlife Resources) were maintained at 28.S*C, sonie 
embryos were thra cultured overnight at RT. Zebrafish embryos were injected at the 1-8 
cell stage with twhh, shh, or /dcZRNA and examined at 28 h of devdopment. (a-c) 
2D Dorsal view of the midbrain-hindbrain region; amerior is left, (a) lacZ. (b) twhh (c) shh. 
(d-0 Frontal optu:aI section of the forebrwt region; anterior is up. (d) lacZ, (h) twhh. (f) 
shh (g-1) Lateral view of the eye region; anterior is left, (g) hcZ (h) twhh. (i) twhh, 

FIGURE 16. is a table showing the effects of ectopic expression of shh, twhh and twhh 
mutants on zebrafish embryonic development. 
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FIGURE 17 shows zebrafish twigg)hwinkle hedgehog derivatives. 17(a) Cartoons of 
various twhh open reading frames. SS (shaded) is the predicted N-tmninal signal 
sequence for secretion of these protdns and encompasses the first 27 stnino acids of each 
open reading fi^me. The arrow indicates the predicted internal site of auto-jproteolytic 
5 cleavage. Amino acid residue numbers are according to Figure 13b The filled triangle 
denotes the normal tomination codon for the /if/eA open readiiig frame. Construct Uha 
contains a mutation that blocks auto-proteolysis (the histidine at residue 273 is changed 
to an alanine; see Lee, J. J., et al, supa ). Construct U3S6|{a contains a stop codon in 
place of amino add residue 357 as well as the H273A mutation in Uha Construct N 
10 encodes just the first 200 amino adds of twhh. Construct C has had the codons for 
residues 31-197 deleted. 17(b) shows in vitro translation of the expression constructs 
diown schenmtically in part a. Constructs were translated in vitro in the presence of ^'S 
methionine and analyzed by autoradiography after SDS^PAGE. 

FIGURE 18 shows Northern blot analysis of the effect of hedgehog on expression of 
15 various neuralniarkers. 

FIGURE 19 shows hh synergy with naturally occurring neural maricers or agents {e.g.^ 
XAG-1, XANF-2, Otx-A, En-2. Krox-20. Xlh box^, NCAM, and EF-lct) 

FIGURE 20 shows Northern blot analysis of the eflFect of hedgehog N or C on various 
neural markers. 

20 FIGURE 21 shows AN-C interferes with X-bhh and N-activity in anirnal cap e)q)lants as 
shown by RT-PCR analysis. 
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DETAILED DESCRimON OF THE INVENTION 

The present invention provides two novel polypq>tides originally derived from a single 
precursor protein, both of v/blch have distinct stnictural and fiinctional characteristics. 
The protdns are derived from a hedgehog protein and can be naturally produced by aut - 
5 proteolytic deavage of the fiill-l^igth hedgehog protdn. Based on evidence provided 
herein, which indicates that hedgehog precursor protdn and the auto-proteolytic products 
of hedgehog precursor protein are expressed in the floorplate of the ventral midline of the 
neural tube and notochord, the invention now provides a method for the induction of 
proliferation or differentiation of neuronal cells associated with or in close proximity t 
10 the floorplate and notochord. 

In a first embodiment, the invention provides a substantially pure po^eptide character- 
ized by having an amino add sequence derived from amino terminal amino acids of a 
hedgehog protein and having at its catboTcy terminus, a ^ydne*cysteine-phenylalanine 
(GICF) deavage site specifically recogpzed by a proteolytic activity of the catboxy 

15 terminal fixign^nt of the native hedgehog polypeptide. This fixigment is denoted the N- 
terminal fitigment or polypeptide or "N", h^dn. For example, in the case of the 
I>osophila hedgehog, the N firagment indudes amino acids J-257 of hedgehog protdn, 
wherein amino adds 85-257 have a molecular weight ol* about 19 kD by non-reducing 
SDS-PAGE (Amino acid residue numbers 1-2S7 indude non-stmctural features such as 

20 dgnal sequences.). The G 1 CP cleavage site m Drosophila hedgdiog precursor protdn 
occurs at amino add residues 2S7-2S9: Those of skill in the art will be able to identify 
the GiCF cleavage site in other hedgehog genes, as the amino add location will be 
similar and the site will be specifically recognized by the autoproteolytic activity of the 
corresponding C fragment. 

25 The N-terminal p lypeptide is also characterized by bdng cd]<^$sociated in cells 
expressing the polypeptide m vitro, and being spedfically localized in vertebrat or 
Drosophila cells or embryos, for example. In other words, this N-terminal fragment of 
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hedgehog, remains dose to the site of cdlular synthesis. The association of N wth the 
cdl is a result of the processing event w4iich involves lipophilic modification of the amino 
terminal domain. (See Figure IE) This modification is initbted by thje action of the 
carboxy tmninal domain* generating a thioester intermediate; the caitoxy^cfnninal 
domain tlms does not act simply as a protease, although cleavage of a peptide bond does 
ultimately resuk fi-om its action. In addition, the N firagmcnt binds to heparin agarose m 
vitro. 

The N polypeptide of the invention is characterized by having an amino add sequmce 
derived fiom amino temunal amino adds of hedgehog protein, e.g. ,1-257 in Drosophila, 
virherdn amino adds 1-257 have a molecular wdght of about 19 kD by non-reducmg 
SDS-PAGE, The N polypeptide includes smaller fii^gments which retain the fiinctional 
characteristics of fiill length N, eg., bind to heparin. The hedgehog protein from v^rhich 
N is derived indudes, but is not limited to Drosophila, Xenopus, chicken, zebrafish, 
mouse, and human. Ciystallographic analysis shows the structure of SHH-N indudes 
the presence of a zinc ion. While not ^ranting to be bound by a particular theory, the 
presence of the adnc ion is suggestive of zinc hydrolase activity. Zinc hydrolases indude 
proteases such as carfooxypeptidase A and thermolysin. Upases such as phosphoIfl)ase C, 
and othar enzymes such as cari>onic anlqrdrase. Alterations in the zinc hydrolase site of 
tf*e amino tominal signaling domain may be usefiil fi)r modulating the range of diffusion 
of a hedgehog protein or to after the signaling charaaeristics of the amino tenninal 
signaling domain. For example, a nwtation in the zinc hydrolase site may result in a 
tethered protein where orduiarily the protein is secreted at a distance. The result would 
be induction of a cdl type not typically induced. Alteration in the zinc site may resuh in 
a molecule capable of inducing motor neurons and not floor plate, and vice versa. 

The identification of a cell-surfece, or extracellular matrix locaBzation of N and its 
expression in notochord and floor plate-associated cells, provides a means for isolation 
or spedfic sdection of cdls expresang N, e.g. , to isolate a notochord sample or to isolate 
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floor plate cells. In addttion, antibodies directed to N are useful for histological analysis 
of tissues suspected of expressing N prptdn. 

The invention also provides a substantially pure polypeptide characterized by having an 
amino acid sequence derived from cBiboxy terminal amino acids of a hedgehog protein 
5 and having at its amino tmninus a GICF cleavage mte specifically recognized by a 
proteofydc activity of the caiboxy terminal fragment of the native hedgehog polypeptide. 
TWs fragment is denoted the C-terminal fragment or polyp^tide or herein. For 
example, in Drosophila this "C" polypq)tide daives from the C-terminal domain of 
hedgehog precursor protein beginning at amino add residue 258, wherein the frill length 

10 C-terminal domain has a molecular weight of about 25 kD by non-reducing SDS-PAGE, 
a histidine residue at poation 72, and has protease activity. The GtCF cleavage ate 
spedficalty recognized by the proteolytic activity of the caiboxy terminal fragment of the 
native hedgehog polypeptide is located at amino acid readues 257-259. As described 
above for the N fragment, now that the pr^nt invention has shown the precise cleavage 

15 recognition site for the autoproteolytic domain of hedgehog, those of skill in the art can 
readily discern the cleavage site in other hedgehog proteins thereby allowing the ready 
, * identification of any N or C polypeptide of aity hedgehog precursor protan. 

The polypeptide of the invention is derived from the C-terminus of a h^gehog 
precursor protein, beginning at the autoproteolytic cleavage site identified at the GCF 

20 amino acid sequence, which in Drosophila corresponds to amino adds 257-259. In 
Dn^^la the histidine readue found mvariably at amino add residue 329 of the native 
hedgdiog protein, and at amino add residue 72 of the C polypeptide, is essential for auto- 
proteolytic deavage between amino acids 257 and 258 (G and p) Corresponding C- 
potypeptides of the invention will likewise contain a similarly locatbd iustidme residue 

25 wUdi can be readily idmtified, such as by comparison to the Drosophila C-polypcptide. 
Among various species, the proteolytic domain can be characterized by the amino add 
sequence -XTXXHLXX-. 
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The C polypq>tide of the invention, unlike N, does not significantly bind to heparin 
agarose. C is characterized by being released into the culture supernatant of cells 
expressing C polypeptide in vitro and by being localized difiiisdy in cells and embiyos. 
Because C polypeptide difiKises fi^ly, h would be detectable in various body fluids and 
5 tissues in a subject. Identification of C polypeptide expression near the midline of the 
neural tube, as described herein, provides a usefiil assay for neural tube closure in an 
embryo/fetus, for example. The presence of C polypeptide in amniotic fluid would be 
diagnostic of a disorder in ^ch the neural tube may be malformed. 

Altered levels of C poiypq)tide in cerebrospinal fluid may be indicative of neuro- 
to degenerative cUsorders, for example. Because C polypeptide is released from the cell 
after synthesis and autoproteolysis of native hedgehog precursor polypeptide, tumors 
synthesLdtng and releasing high levels of C polypeptide would be detectable without prior 
kno^edge of the exact location of the tumor. 

C fragment is effective in inducing genes of the pituitary and anterior brain as well. In 
15 particular, induction is increased by the addition of a member of the TGF-p &mily of 
growth factors. For example, human activin in combination with C fi^agment may be^ 
effective in enhandng pituitary cell growth and activity or development. 

C fragment is effective in inducing posterior maiicers of the brain by inhibiting N. Such 
a fiagment is exen[q>lified in Example 18 as AN-C. Therefore in another onbodiment, the 

20 invention includes a polypeptide deleting amino acid residues 28-194 of X-bhh. 
(Autoproteolysis gives a C domain of 198-409 as well as a seven amino acid peptide, 
representing aa 24-27 and 195-197). This polypeptide blocks the activity of X-bhh and 
N in explants and reduces dorsoanterior structures in embryos Also included are 
polynucleotide sequences encoding AN-C. AN-C is useful for increasing expression of 

25 posterior neural markers (e.g., En-2» Krox-20, Xlttbox-6) and decreasing expression of 
anterior neural markers (e.^., XANF-2, XAG-1, Otx-A) when desu'able to do so to 
modulate neural patterning. 
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The term "substantially pure** as used herein refers to hedgehog N or C polypeptide 
which is substantially free of other proteins, lipids, carbohydrates, nudeic acids or other 
materials with \^ch it is naturally associated. One skiOed ki^e art can purify h^^ehog 
N or C polypeptide using standard techniques for protein pur^cation. The substantidly 
s pure polypeptide will yield a angle nuijor band on a non-reducing polyacrjdamide gd. 
The purity of tte hedgdiog N or C polypeptide can also be determined by amino-terminal 
amino add sequence analysis. 

The invention includes a fimctional N or C polypeptide, and fiinctional fragments 
thereof As used herein, the temi **fiincdonal polypeptide" or ''functional fragment" refers 

10 to a polypeptide which possesses a biological function or activity which is identified 
through a defined functional assay and vMch is associated with a particular biologic, 
morphologic, or phenotypic aheration in the cell. Functional fragmmts of the hedgehog 
N or C polypeptide include fragments of N or C polypeptide as long as the activity, e.g., 
proteose activity of C polypq>tide remains. Smaller peptides containing the biological 

15 activity of N or C polypeptide are therefore included in the invention. The biological 
function, for example, can vary from a polypeptide fragment as snudl as an epitope to 
which an antibody molecule can bind to a large polypeptide i^ch is capable of 
particq)ating in the characterise induction or programnung of phenotypic changes within 
a cell. . A ''functional polynudeotide" denotes a polynucleotide which encodes a 

20 functional polypeptide as described herein. 

Minor modifications of the N or C polypeptide primary amino add sequdice nuiy result 
in polypeptides which have substantiaDy equivalent activity as compared to the N or C 
polypeptide described herdn. Such modifications may be ddiberate, as by site-directed 
mutagenesis, or may be spontaneous. All of the polypeptides produced by^ these 
25 modifications are inchided herein as long as the proteolytic activity of C polypeptide, for 
example, is present. Further, deleti n of one or more amino adds can alsfb result in a 
mpdificati n of the structure of the resultant molecule without significantly altering its 
acthdty. This can lead to the devdopment of a smaller active molecule which would have 
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broader utility. For example, it is possible to remove amino or caiboxy terminal amino 
acids which may not be required for N or C polypeptide activity. 

The N or C polypeptide of the invention also includes conservative variations of the 
polypeptide sequence. The term "conservative variation** as used herein denotes the 
5 replacement of an amino acid residue by another biologically similar residue. &canq)les 
of conservative variations include the substitution of one hydrophobic residue such as 
isoleucine, valine, leucine or methionine for another, or the substitutibn of one polar 
residue for another, such as the substitution of arginine for lysine, glutamic for aspaitic 
acids, or gjutamine for asparagine, and the like. The term "conservative variation** also 
10 indudes the use of a substituted amino add in place of an unsubstituted parent amino acid 
provided that antibodies raised to the substituted polypeptide also immunoreact with the 
unsubstituted polypeptide. 

The N fragment of the invention includes both the active form of the polypeptide and the 
N fragment including the undeaved signal sequence. For example, m Droso/^ila v^here 
IS the signal sequence is internal (at about amino acids 60-80), the entire uncleaved N 
' fragment beginning at the initiating methionine is included in the mvention. Those of 
skill in the art can readily ascertain the nature and location of the signal sequence by 
using, for example, the algorithm described in von Heijne, G., NucL Acids Res. 14:4683, 
(1986). 

20 The invention also provides an isolated polynucleotide sequence consisting essentially 
of a polynudeotide sequence encoding a polypeptide having the amino acid sequence of 
N or C polypeptide of the invention. The term "isolated** as used herem includes poly- 
nucleotides substantially free of other nucleic adds, protdns, lipids, carbohydrates or 
other materials with which it is naturally associated. Polynudeotide sequences of the 

25 invention indude DNA, cDNA and RNA sequences \^ch encode N or C polypeptide. 
It is understood that all polynucleotides encoding all or a portion of N or C polypeptide 
are also included herein, as long as they mcode a polypeptide with N or C polypeptide 
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activity. Sudi po^ucleotides include naturally occurring, synthetic, and intentionaUy 
manipulated polynucleotides. For example, N or C polypeptide polynucleotide may be 
subjected to site-directed mutagenesis. The polynucleotide sequence for N. or. C 
polypeptide also includes antisense sequences. The polynucleotides of the invention 
5 inchide sequoices tiiat are d^enmte as a result of the genetic code. There are 20 natural 
amino acids, most of which are specified by more than one codon. Therrfore, all 
degenerate nucleotide sequences are included in the invention as long as the amino acid 
sequence of N or C polypeptide polypeptide encoded by the nucleotide sequence is 
fiinctioiudly unchanged. In addition, the invention also includes a polynucleotide 
10 consisting essentially of a polynucleotide sequence mcoding a polypeptide having an 
amino acid sequence of N or C and having at least one qntope for an antibody 
immunoreactive with N or C polypeptide. 

The polynucleotide encoding N or C polypeptide includes the entire polypeptide or 
firagments thereof as well as nucldc add sequences complementary to that s^uence. A 
15 complemratary sequence may include an antisense nucleotide. When the sequence is 
RNA, the deoTcynucleotides A, G, C, and T are replaced by ribonucleotides A, G, C, and 

U, respectively. Also inchided in the invention are fin^ents of the above-described 

• " . ■ ' ' ' 

nucleic acid sequences that are at least 15 bases in length, which is suflBcient to permit 
the fiagment to selective^ hybridize to DNA that encodes the protein under physiologi- 
20 cal conditions. PrefiET{d>ly, the fiagments hybridize uiRier stringent conditions. 

DNA sequences of the invention can be obtained by several methods. For example, the 
DNA can be isolated using hybridization techniques vAich are well known in the art. 
These indude, but are not limited to: 1) iQfbridization of genomic or cDNA libraries with 
grbbes to detect homologous nucleotide sequences; 2) antibody screening of expr^sion 
25 libraries to detect cloned DNA fragments with shared structural features; and 3) PGR 
amplification of a desired nucleotide sequence using oligonucleotide primers. 
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Preferably the hedgehog, N, or C polynucleotide of the invention is derived from a 
vertebr^ organism, and most prefoably from human. Soreening procedures which rely 
on nucleic acid hybridization make it possible to isolate any gene sequence from any 
organism, provided the appropriate probe is available. Oligonucleotide probes^ which 

5 correspond to a part of the sequence encoding the protdn in question, can be synthesized 
chenncalty: This requires that short, oligopeptide stretches of amino acid sequence must 
be known. The DNA sequence ^coding the protein can be deduced from the genetic 
code, however, the degeneracy of the code must be taken into accomit. It is posable to 
perform a nuxed addition reaction ^en the sequence is degenerate. This includes a 

10 heterogeneous mixture of denatured double-stranded DNA. For such screening 
hybridization is preferably performed on either single-stranded DNA or denatured 
double-stranded DNA. Hybridization is partiodarl/ usefiil in the detection of cDNA 
clones derived from sources where an extremely low amount of mRNA sequences 
relating to the polypeptide of interest are present. In other words, by using stringent 

15 hybridization conditions directed to avoid non-specific binding, it is possible, for 
example, to allow the autoradiographic visualization of a specific cDNA clone by the 
hybridization of the target DNA to that angle probe in the mixture which is its complete 
complmert (Wallace, a/,, Mfc^^ 

The development of specific DNA sequences oicoding hedgehog can also be obtained 
20 by: 1) isolation of double-stranded DNA sequences firom the genomic DNA; 2) diemical 
manufecture of a DNA sequence to provide the necessary codons for the polypeptide of 
interest; and 3) in vitro synthesis of a double-stranded DNA sequence by reverse 
transcription ofmRNA isolated from a eukaryotic donor cell. In the latter case, a double- 
stranded DNA complement ofmRNA is eventually formed which is gencndly rcferre^^ 
25 to as cDN A. 'j^'-'^V 

Of the three above-noted methods for developing specific DNA sequences for use m 
recombinant procedures, the isolation of genomic DNA isolates is the least conuhojj: 
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This is especially true when it is desirable to obtain the microbial expression of 
mammalian polypeptides due to the presence of introns. 

The synthesis of DNA sequences is frequently the method of choice when the entii-e 
sequence of amino acid residues of the desired polypeptide product is known. When the 

5 mdre sequence of amino add readues of the desired polypeptide is not known, the direct 
synthesis of DNA sequences is not posdble and the method of choice is the synthesis of 
cDNA sequences. Among the standard procedures for isolating cDNA sequences of 
interest is the formation of plasmid- or phageK:arrying cDNA Ubraries which are derived 
from reverse transcription of mRNA which is abundant in donor cells that have a high 

10 level of genetic expresaon. When used in combination with pol)m3erase chain reaction 
technology, evffl rare expresaon products can be cloned. In those cas^ where significant 
portions of the amino acid sequence of the polypeptide are known, the production of 
U6eled angle or double-stranded DNA or RNA probe sequences duplicating a sequence 
putatively pr^ent in the target cDNA may be miployed in DNA/DNA hybridization 

IS procedures which are carried out on doned copi^ of the cDNA wdiich have been 
denatured into a angle-stranded form (Jay, et aL, NucL Acid Res., 11:2325, 1983). 

; A preferred method for obtaining genomic DNA, for examplie, is Polymerase Ghaih 
Reaction (PCR), which relies on an in vitro method of nuddc acid synthesis by which 
a partkrular segmrat of DNA is spedfically replicated Two oligonucleotide primers that 

20 flank the DNA fragment to be amplified are utilized in r^>eated cycles of heat 
daiaturation of the DNA annealing of the primers to their complementary sequences, and 
extension of the annealed primers widi DNA polymerase. These primers hybridize to 
opposite strands of the target sequence and are orifflted so that DNA synthesis by the 
polymerase pixK:eedsa(TOSS the region b^weoi the pi^ Since the extension products 

25 themselv^ are also complementary to and enable of bindiqg primers^ successive cycles 
of amplificati n essentially ^ouble the amount of the target DNA syntheazed in the 
previous Qrde. The result is an e)q)onential accumulati n f the spediic target fragment, 
approximately 2", where n is the number of cydes of ampliiScatton performed (see PCR ^ 
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Protocpls, Eds. Itrns, et al.^ Academic Press, Inc., 1990, incorporated herein by 
reference), 

A cDNA expression library, such as Agtl 1, can be screened indirecdy for hedgehog, N, 
or C polypeptides having at least one epitope, imng antibodies specific for hedgehog, N, 
5 or C. Such antibodies can be either polyclonally or monoclonaUy dmved and used to 
detect expression product indicative of the presence of the desired hedgehog cDNA. 

The polynucleotide sequence for hedgehog, N, or C, also includes sequences 
conq>lementary to the polynucleotide encoding hedgehog, N or C (antisense sequences). 
Antisense nucldc adds are DNA or KNA molecules that are complemratary to at least 

10 a portion of a specific mRNA molecule (Weintraub, Scientific American^ 2^:40, 1 990). 
The invention embraces aU antisense polynucleotides capable of inhibiting production of 
hedgehog, N, or C polypeptide. In the cell, the antisense nucleic acids hybridize to the 
corresponding mRNA, forming a doubles-stranded molecule. The antisense nucleic acids 
interfere with the translation of the mRNA since the cell will not translate a mRNA that 

15 is double-stranded. Antisense oligomers of about 1 S nucleotides are preferred, since they 
are easily synthe^zed and are less likely to cause problems than larger molecules when 
^ introduced into the target hedgehog, N, or C-producing cell. The use of antisense 
m^ods to inUbit the translation of genes is well known in the art (Marcus-Sakura, Anal, 
Biochem., 1^:289, 1988): Inhibition of target nucleotide would be desirable, for 

20 example, in inhibiting cdl-proliferative disorders, such as certain tumors, which are 
mediated by hedgehog, N or C. 

In addition, ribozyme nucleotide sequences for hedgehog, N or C are included in the 
invention. Ribozymes are RNA molecules possessing the ability to specifically deave 
other ^ngle-stranded RNA in a manner analogous to DNA restriction endonucleases. 
25 Through the modification of nucleotide sequences which encode these RNAs, it is 
posable to engineer molecules that recognize specific nucleotide sequences in an RNA 
molecule and cleave it (Cech, J. Amer. Med Assn., 260:3030, 1988). A major advantage 
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of this spproBch is that, because they are sequence-specific, only mRNAs with particular 
sequences are inactivated. 

There are two basic types of ribozymes namely, tetrahymena-type (Hasselhoff, Nature, 
2M:585, 1988) and "hammerhead'-^rpe. Tetrahymem-type riboqoncs recognize 
5 sequences vdudi are four bases in length, v^e"hamnierhead^ 

base sequences 1 1-18 bases in length. The logger the recognition sequence, the greater 
the likelihood that sequence will occur exclusivdy in the target mRNA species. Conse- 
quently, hanunerhead-type ribozymes are preferable to ietrahymem-type ribo;^es for 
inactivating a specific mRNA spedes and 18-based recognition sequences are preferable 
10 to shorter recognition sequences. 

DNA sequences mcoding hedgehog, N or C can be expressed in vitro by DNA transfer 
into a suitable host cdl "Host cells** are cells in which a vector can be propagated and 
its DNA expressed. The term also includes any progeny of the subject host cell. It is 
understood that all progeny may not be identical to the parental cell since there may be 
15 mutations that occur during replication. However, sudi progeny are included when the 
term "host cell" is used. Methods of stable transfer, meaning that the fbrdgn DNA is 
continuously maintained in the host, are known in the art. 

In Ae present invention, the hedgetog, N or C polynucleotide sequences may be inserted 
into a recombinant expression vector. The term "recombinant expresaon vector" refers 

20 to a plasmid, virus or other vdiicle known in the art that has been manipulated by 
insertion or incorporation of the hedgehog, N or C genetic sequmces. Such expression 
vectors contain a promoter sequence ^ch ftdlitates the efficient transcription of the 
inserted genetic sequence of the host. The expression vector typically contains an origin . 
of reglicatipn, a promoter, as wdl as specific genes which allow phenotypic selection of 

25 th^%ansformed cells. Vectors suitable for use in die present invention include, but are 
not limited to the T7-based expression vector for expression in bacteria (Rosenberg, et 
al,. Gene, i6: 125, 1987), the pMSXND ejq)rcssion vector for expression in mammalian 
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cdls (Lee and Nathans, 1 Biol Chem., 263:3521, 1988) and baculovinis-derived v^ors 
for expression in insect cells. The DNA segment can^ be present in the vector operably 
linked to regulatory elements, for example, a promoter (e.g., T7, metallothionein I, or 
polyhedrin promoters). 

Polynucleotide sequences encoding hedgehog, N or C can be expressed in either 
prokaiyotes or eukaryotes, although post-translational modification 6f eukaryotically 
derived polypeptides, such as cartxngdation, would occur in a eukaiyotic host. Hosts can 
indude microbial, yeast, insect and mammalian organisms. Mt^ods of expressing DNA 
sequences having eukaryotic or viral sequences in prokaryotes are well ktiown in the art. 
Biologically functional viral and plasmid DNA vectors capable of expression and 
replication in a host are known in the art. Such vectors are used to incorporate DNA 
sequences of the invention. 

Methods which are weD known to those skilled in the art can be used to construct 
expression vectors containing the hedgehog, N or C coding sequence and appropriate 
transcriptional/translational control signals. These methods include in vitro recombinant 
DNA techniques, synthetic techniques, and i/i Wvo recombination/genetic techniques. 
See, for example, the techniques described in Maniatis, ei al,^ f 989 Molecular Cloning 
A Laboratory Manual, Cold Spring Harbor Laboratory, KY! 

A variety of host-expression vector systems may be utilized to mpress the hedgehog, N 
or C coding sequence. These include but are not limited to microorganisms such as 
bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid 
DNA expression vectors contaimng the hedgehog, N or C codir^ sequence; yeast 
transformed with recombinant yeast expression vectors contmning the hedgehog, N or C 
coding sequence; plant ceU systems infected with recombinant virus eiqpression vectors 
(e.g:, cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with 
recombinant plasmid expression vectors (e.^., Ti plasniid) containing the Hedgehog, N 
or C coduig sequence; insect cell systems infected with recombinant virus e>q>ression 
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vectors (e.g:, baculovinis) containing the hedgehog, N or C coding sequence; of animal 
cell systems infected with recombinant virus e3q>Fession vectors retroviruses, 
adenovirus, vaccinia virus) containing the hedgehog, N or C coding sequence^ or 
transformed animal cell systems engineered for stable expression. 

Depending on the host/vector system utilized, any of a number of suitable transcription 
and translation elements, including constitutive and mducible promoters, transcription 
enhancer elements, transcription tmninators, etc., may be used in the expression vector 
(see e.g. , Bittw, et al , 1987, Methods in Enzymology, 1 53 : 5 1 6-544). For example, whai 
cloning in bacterial systems, inducible promoters such as pL of bacteriophage y , ^lao, 
ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used. When cloning in 
mammalian cell systems, promoters derived from the genome of nuunmalian cells (e.g., 
metallothionein promoter) or from mammaUan viruses (e.g., the retrovirus long terminal 
repeat; the adenovirus late promoter; the vaccinia virus 7.SK promoter) may be used. 
Promoters produced by recombinant DNA or synthetic techniques may also be used to 
provide for transcription of the inserted hedgehog, N or C coding sequence; 

In bacterial systems a number of expression vectors may be advantageously Elected / 
dq>endi^g upon the use intended for the oq>ressed. For example, when large quantities 
of hedgehog, N or C are to be produced, vectors which direct the expression of t)igti 
levels of fusion protein products ^t arcreadSy purifiediiiay be desirable. Those ^ch 
are engineered to contain a cleavage ate.to aid in recovoing are preferred. Such vectors 
indude but are not limited to the £ colt €9q)ression vector pUR278 (Ruther, et al., EMBO 
J., 2:1791, 1983), in which the Hedgehog, N or C coding sequence may be ligated into 
the vector in fhune with the lac Z coding region so that a hybrid -lac Z protein is 
produced; pIN vectors (Inouye and Inouye, Nucleic Acids Res,, ll:31(Jl, 1985; Van 
Heeke and Schuster, J. BioL Chem. 264:5503, 1989) and the like. 



In yeast, a number of vectors containing constitutive r inducible promoters may be used. 
For a review see. Current Protocols in Molecular Bi 1 gy. Vol. 2, 1988, Ed. Aiisiibel, et 
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a/., Gnene Publish. Assoc. & WUey Interscietice. Ch. 13; Grant, et al., 1987, Expression 
and Secretion Vectors for Yeast, io Methods in Enzymology, Eds. Wu and Grossman, 
31987, Acad. Press, N.Y., Vol. 153, pp.5 16-544; Glover. 1986, DNA Cloning, Vol. % 
JBL Press, Wash., D.C., Ch. 3; and Bittw, 1987, Heterologous Gaie Ejq)r«ision in Yeist, 
s Methods in Enzjmiology, Eds. Boger and Kimmel. Acad. Press, N. Y , Vol. 1 52, pp. 673- 
684; and The Molecular Biology of the Yeast Saccharomyces, 1982, Eds. Strathon, el 
a/.. Cold Spring Haibor Press, Yds. land n. A constitutive yeast ivomoter such as ADH 
or LEU2 or an inducible promoter such as GAL may be used (Cloning in Yeast, Ch. 3, 
R. Rothstetn In: DNA Ooning Vol. 1 1, A Practical Approach, Ed. DM Glover, 1986. IRL 
10 Press. Wash., D.C.). Alternatively, vectors may be used which promote integration of 
foreign DNA sequences into the yeast chromosome. 

In cases where plant expressi(Hi vectors are used, the e}q[>ression of the hedgehog, N or 
C coding sequence may be driven by any of a number of promoters. For example, viral 
promoters such as the 3 SS RNA and 19S RNA promoters of CaMV (Brisson, et al., 

IS Natare, ilQ:5 1 1, 1984), or the coat protein promoter io TMV (Takamatsu. et al.. EMBO 
J., 6:307, 1987) may be used; altonatively, plant promoters sudi as the small subunit of 
RUBISCO (Coruzzi. et al., EMBO J., 1. 1671-1680, 1984; BrogUe, et al.. Science,/ 
^:838, 1984); or heat shock promoters, e.g„ soybean hspl7.5-E or hspl7.3-B (GurleyJ^ . 
c/a/., Mo£ Cc///Bw/.,6:559, 1986)maybeus^^ These constructs can be introduced 

20 into plant cdis using Ti fdasmids. Ri plaanids, plant vims vectors, direct- DNA 
tnmsfixmadon, nuooingection, dec^poration, etc. For reviews of such techniques see, 
for ocample, WeisAach and Weisd)ach, 1988, Methods for Plant Molecular Biology, 
Academic Press, NY, Section Vm, pp. 421-463; and Grierson and Corey, 1988, Plant 
Molecular Biology, 2d Ed., Blackie. London, Ch. 7-9. 

25 An alternative eiqiression ^stem which could be used to express is an insect ^eni. In - 
one sudi system, Autogn^ califomica midear poljiiedroas virus (AcNPV) is used as 
a vector tp express fordgn genes. The vinis grows in ^poefo/rlerayhr^pen/e? cdls. the 
hedgdiog, N or C coding sequence may be cloned into non-essential regions (for example 
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the polyhedrin gene) of the vinis and placed under control of an AcNPV promoter (for 
emaple the pol)^edrin promoter). Successfid insertion of the hedgehog. N or C coding 
sequence will result in inactivation of the polyhedrin gene and production of non- , 
occluded recombinant virus (i.e., virus lacking the proteinaceous coat coded for by the 
polyhedrin gene). These recombinant viruses are then used to infisct Spodaptera 
fiuffperda cdls in wlridi the inserted gene b oq>ressed. (eg., see Smith, et al., J. Viol , 
46:584, 1983; Smith, U.S. Patent No. 4,215,051). 

Eukaryotic systems, and preferably mammalian ejq)ression qrst^ms, allow for proper 
post-translational modifications of expressed mammalian proteins to occur. Eukaryotic 
cdls which possess the cdlular machinoy for proper processing of the primary transcript, 
glycosylation, phosphorylation, and advantageously, secretion of the gene product may 
be used as host cells for tiie expression of hedgehog. N or C. Mammalian cell lines ra^ 
be preferable. Such host ceD lines nay inchide but are not limited to CHO. VERO. BHK, 
HcLa, COS. MDCK, -293, and WD8. 

Mammalian ceU systems whidi utilize recombinant viruses or viral elonents to direct " 
ejq)resfflan may be engineered. For ©cample, ^cn using admovirus expression vectorsr, 
the hedgehog, N or C coding sequence m^ be ligated to an adenovirus transcrq)tion/- 
translation control complex, e,g:, the late promoter and tripartite leader seqtiMce. This 
chimeric gene may then be inserted in the adenovinis genome by in vitro or in vivo 
recombination. Insertion in a non-essential re^on of the viral genome (e.g:, region El 
or E3) will result in a recombinant virus that is viable and capable of expressing the 
protem in infected hosts (e.^M see Ix)gan and Shenk,i>^ WW, 813655, 

1984). Altemativdy, the vaccinia virus 7.5K promote may be used. (e*g., see, Mackett, 
et al, Proc, Natl Acad ScL USA, 79:7415, 1982; Mackett, et aL, J. Wro/., 49: 857, 
1984;Pamcali,e/a/., Prvc. Natl. Acad Sci. 17X4, 22:492'?, 1982). Of particular interest 
are vectors based on bovine papilloma vims which iay|^e ability to replicate as 
crtrachiomosomal dements (Sarver, e/a/„ MoL Cell. fito/.. 1:486, 1981). Shortly after 
entiyof^DNAintomousecells,thepla5midreplicatestoabout lOOt 2d0copiesper 



1 

96/17924 



1 



1 



96/17924 PCTAJS9S/15463 

-26- 

ceD. Tnuiscription of the inserted cDNA does not require integration of the plasmid into 
the host's chromosome, thereby yielding a high level of expression. These vectors can 
be used for stable eq)res5ion by including a selectable marker m the plasmid^ such as, for 
example^ the bsi gene. Alternatively, the retroviral genome can be modified for use as 
a vector capable of introdudng and directing the expression of the hedgehog, N or C gene 
in host cells (Cone and Nfumgan,i^.Ar^^^ &I:6349, 1984). High level 

expresaon may also be achieved using inducible promoters, including, but not limited to, 
the metallothionine nA promoter and heat shock promoters. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. Rather than uai^ eqmsaon vectors whidi contain viral origins of replication, 
host cdls can be transformed vnAi the hedgehog, N or C cDNA controlled by appropriate 
expression control elements (e.g., promoter, enhancer, sequences, transmption 
terminators, polyaden)dation sites, etc.), and a selectable marker. The selectable marker 
in the recombinant plasmid confers resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow to form foci v^^iich in turn can be 
cloned and expanded into cell lines. For example, following the introduction of fordgn 
DNA, engineered cells may be allowed to grow for 1-2 days in an enridied media, and 
thm are switched to a selective media. A numb^ of sdection systems may be used; : 
including but not limited to the herpes simplex virus thymidine kinase (Wigler^ et af„ 
Cell, 11: 223, 1977), hypoxanthine-guanine phosphoribosyltransferase (Szybalska and 
Szybalski, Proc. Natl Acad ScL USA, 48:2026, 1962), and admine phosphoribosyl- 
transferase (Lowy, et a/., Cell, 22.' 817, 1980) gmes can be mipioyed in tk', hgprt' or 
aprt' cdls respectively. Also, antimetabolite resistance can be used as the basis of 
selection for dhfr^ vAncYt confers resistance to methotrexate (Wigler, et aL, Natl Acad. 
ScL USA, 22: 3567, 1980; OHare, etai. Proa. Natl. Acad Sci. USA, 28: 1527, 1981); 
gpt, wMdi confers resistance to mycophenolic add (Mulligan and Berg, Proc. Natl. Acad 
Sci. USA, 28. 2072, 1981; neo, which confers resistance to the aminoglycoside G-4 18 
(Colberre^janvin, etal,, J. MoL Biol, 150:1, 1981); and hygro, which confers resistance 
to hygromydn (Santerre, et al. Gene, 30:147, 1984) genes. Recently, additional 
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selectable genes have been described, namely trpB, ^ch allows cells to utilize indole 
in place of tryptophan; hisD, which allows cells to utilize histinoi in place of histidine 
(Hartman and Mulligan, Proc. NatL Aaxi Set. 8S:8047, 1988); and CH>C (ornithine 
decarboxylase) whidi confers rie^stance to the ornithine decarboxylase inhibitor, 2- 
5 (d]fiuorQmetlQfl)-DL-omithine, DFMO (McConlogue L., 1987; In: Current Communica- 
tions in Molecular Biology, Cold Spring Harbor Laboratory ed,). 

Transformation of a host ceU with recombinant DNA nmy be carried out by conventional 
techniques as are well known to those skilled in the art. Where the host is prokaryotic, 
such as R coli^ competent cells which are capable of DNA uptake can be prepared from 
10 cells harvested after exponential growth phase and subsequently treated by the CaCl2 
method using procedures well known in the art. Alternatively, MgClj or RbCl can be 
used. Transformation can also be performed after forming a protoplast of the host cell 
if desired. 

Whoi the host is a eukaiyote, such m^ods of transfection of DNA as calcium phosphate 
15 co-precipitates, conventional mechanical procedures such as microinjection, electro- 
poration, insertion of a pla^d encased in liposomes, or virus vectors may be used^ 
Eukaiyodc cdls can also be cotransformed with DNA sequences encoding the hedgehog, 
N or C of the invention, and a second foreign DNA molecule encoding a selectable 
phenotype, such as the herpes sin:4>lex thymidine kinase gene. Another method is to use 
20 a eukaryotic viral vector, such as simian virus 40 (SV40) or bo^e papilloma virus, to 
transiently infect or transform eukaryotic cells and express the protein, (see for example, 
Eukaryotic Viral Vectors, Cold Spring Harbor Laboratory, Ghizman ed., 1982). 

Isolation and purification of microbial expressed polypeptide, or fragrpeiit^ thereof, 
provided by the invration, may be carried out by conventional means including 
25 ' preparative chromatography and immunological sqiarations involving mpnocfonal or 
polyclonal antibodies. 
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The invention includes antibodies immunoreacti ve with or which bind to hedgehog, N or 
C polypeptide or functional finqgments thereof Antibody which consists essentiaUy of 
pooled monoclonal antibodies with different epitopic specifictttes, as well as distinct/ 
monoclonal antibody preparations are provided* Monoclonal antibodies are made from 
5 antigoi containing fragments of the protdn by methods well known to those skilled in the 
art (Kohler, et al. Nature, 256:495, 1975). The term antibody as used in this invention 
is meant to inchide intact molecules as well as fragments thereof, such as Fab and F(ab^)2, 
which are capable of binding an epitopic det^minant on hedgehog, N or C. The 
antibodies of the invention include antibodies which bind to the N or C polypeptide and 
to which bind with immunoreactive fragments N or C. 

The term "antibody** as used in this invention includes intact molecules as well as 
fragments thereof, such as Fab, F(ab*)2, and Fv ^^ch are capable of binding the epitopic 
determinant. These antibody fragments retain some ability to selectively bind with its 
antigen or receptor and are defined as follows: 

Fab, the fragment which contains a monovalent antigen-binding fragment of an 
antn)pdY molecule can be produced by digestion of whole antibody vdth the 
enzyme papain to yield an intact light chain aiui a portion of one heavy diain; 

Fab', the fragment of an antibody molecule can be obtained by treating whole 
antibody with pepsin, followed by reduction, to yield an intact light chain and a 
portion of the heavy chain; two Fab' fragments are obtained per antibody 
molecule; 

(Fab*>2, the fragment of the antibody that can be obtained by treating whole 
andbody with the en^e pepstii without subsequmt reduction; F(ab')2 is a dimer 
of two Fab' fragments held together two disulfide bonds; 



15 (1) 



(2) 



20 



(3) 



W09d/i7tt4 PCT/US9S/15463 

-29- 

(4) Fv, defined as a genetically engineered fragment containing the variable 
genetically fused single chain molecule. 

Methods of making these fragments are known in the art. (See for example, Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Hart>or Laboratory, New York 
s (1988), incorporated herein by reference). 

As used in this invention, the tenn "qiitope" means any antigenic determinant on an 
antigen to which the paratope of an antibody binds. Epitopic determiiumts usually consist 
of chemically active sur&ce groupings of molecules such as amino acids or sugar side 
chains and usually have specific three dimensional structural characteristics, as well as 
10 specific charge characteristics. 

Antibodies which bind to the hedgehog, N or C polypeptide of the invention can be 
prepared uang an intact polypeptide or firagments containing small peptides of interest 
as the immuniang antigen. The polypeptide sudi as N or C, or Augments thereof used 
to immunize an animal can be derived from translated cDNA or chemical synthesis vMch 
15 can be conjugated to a carrier protein, if desired. Such commonly used carriers wMich are 
diemically coupled to the peptide inchide keyhole limpet hemocyanin (KLH), 
tlQTOgJobulin, bovine serum albumin (BSA), and tetamis toxoid. The coupled peptide is 
then used to hnmunize the animal (e.g., a mouse, a rat, or a rabbit). 

If desired, polyclonal or monoclonal antibodies can be fiirther purified, for exanq>le, by 
20 binding to and elution from a matrix to which the polypeptide or a peptide to which the 
antibodies were raised is bound. Those of skill m the art will know of various techniques 
common in the inmmnology arts for purification and/or concoitration of polyclonal 
antibodies, as well as monoclonal antibodies (See for example, Coligan, et aL, Unit 9, 
Current Protocok in Immunol^, Wiley Int^^ience, 1991, incorporated by refiMrcnce) 
25 It is also pos^ble to use the anti-idiotype technology to produce monoclonal antibodies 
which mimic an epitope. F r example, an anti-idiotypic monoclonal aiitibody made to 
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a first monodonal antibody will have a binding domain in the hypervariabte region which 
is the "image" of the epitope bound by the first monoclonal antibody. 

Antibodies as described herein as having specificity for N polypeptide, e.g., Abl 
(residues 83-160), are use&l for specific identification of cells or tissues expressing the 
N fiagment of hedgehog. Simiterly, antibodies described herdn as having spedficity for 
C polypqydde, e.g.. Abl (residues 300-391X are usefid for specific identification of cells 
or tissues expressing the C fimgment of hedgdiog. Both antibodies, naturally, will also 
detect native hedgehog polypeptide. 

The N and C-spedfic antibodies of the invention are usefid for purification of N and C 
polypeptide, respectively, e^edally using the antibodies'immobilized on solid phase. 
By contacting a sample with anti-N antibody, both N and native hedgehog polypeptides 
can be isolated. By next contacting the sample removed by anti-N antibodies, with anti-C 
antibodies, the native hedgehog polypeptide is removed, thus aOowing purification of N 
polypeptide. In a similar manner, C polypeptide can be antibody purified fi^om a sample. 

Monodonal antibodies of the invention are suited for use, for exanq)le, in unmunoassays 
in which they can be utilized in liquid phase or bound to a solid phase carrier. In 
addition, the monoclonal antibodies in these inununoassays can be detectably labeled in ^ 
various ways. Example of types of immunoassays which can utilize monoclonal 
antibodies of the invention are competitive and nonHX>mpetitive immunoassays in either 
a direct or indirect fonnat. Examples of such immunoassays are the radioimmunoassay 
(RIA) and the sandwich (immunometric) assay. Detection of the antig^ using the 
monoclonal antibodies of the invention can be done utilizing immunoassays which are 
run in dther the forward, reverse, or simultaneous modes, including immunohistochemi- 
cal assays on physiological sanqiles. Those of skill in the art will know, or can readily 
discern, other immunoassay formats without undue e)q>erimentation. 
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The term '*immunoinetric assay" or "sandwich immunoassay % indudes simultaneous 
sandwich, forward sandwich and reverse sandwich immunoassays. These terms are well 
understood by those skilled in the art. Those of skill will also ^ipreciate that antibodies 
according to the present invention will be usefiil in other variations and forms of assays 
v^ch are presently known or which may be developed in the future. These are intended 
to be included within the scope of the present invention. 

Monoclonal antibodies can be bound to many different carriers and used to detect the 
presence of N or C polypeptide. Examples of well-known carries uiclude glass, 
potystyrene, polypropylene, polyethjiene, dextran, nylon, amylases, natural and modified 
celluloses, polyacryiamides, agaroses and magnetite. The nature of the carrier can be 
either soluble or insoluble for purposes of the invention. Those skilled in the art will 
know of other suitable carriers for binding nionoclonal antibodies, or will be able to 
ascertain such using routine experimentation. 

For purposes of the invention, N or C polypeptide may be detected by the monoclonal 
antibodies when present in biolo^cal fluids and tissues. Any sample contaimng a 
detectable amount ofN or C Can be used. A sample can be a liquid such as urine, saliva, 
cerebrospinal fluid, blood, serum and the like, or a solid or semi-solid such as tissues^ 
feces, and the like^ or, alternatively, a solid tisaie such as those commonly used in 
histological diagnosis. C polypeptide in particular is detectable in biological samples, 
since it tends to difiiise more readily than N polypeptide. 

In performing the assays it may be de^rable to include certain "blockers" in the 
incubation medium (usually added with the labels soluble antibody). The *1)lockers** are 
added to assure that non-specific proteins, proteases, or anti-heterophilic immuno- 
globulins to anti-C or N immunoglobulins present in the experimental sample do not 
cross-link or destr y the antibodies on the solid phase support, or the radiolabeled 
indicat r antibody, ta^eld &lse positive or &lse negative results. The selection of 
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*1>lock€rs** therefore add substantially to the specifictty of the assays described in the 
present mvention. 

The invention also provides a method for modulating proliferation or differentiation of 
neuronal cells comprising contacting the cells with a hedgehog polypeptide. The 
hedgehog polypeptide may be a native hedgehog polypeptide* or a N or C polypeptide, 
or functional fragments thereof Preferably, the modulation is induction of proliferation 
or differentiation of a particular cell type. This can involve either synergistic positive 
induction of nraronal cells by N, or negative modulation by ddta N-C for example (Lai, 
etal., Devehpment 121:2349, 1995). Ddta N-C enhances expression of postdier relative 
to anterior neural genes and does so through inhibition of N (see EXAMPLE 18 and 
Figure 18D). In addition to hedgehog polypeptide, a TGF-P factor may also be utilized 
in the method of the invention. 

Previous studies with the rat hedgehog gene showed that co*culture of cells e7q>ressii:^ 
rat hedgehog precursor gene, with explant from neural tube, was sufficient to induce 
formation of motor neurons and floor plate from the ocplant (Jessesl, T., and Dodd, J., In 
Cell-Cell Sigruding in Vertebrate Development (cd. E.J. Robertson, eial, pp 139.155, 
San Diego, Ca.), 1993). Therefore, based on the Examples herein^^o wing that hedgehog 
is expressed near the floorplate of the ventral midline of the neural tube and notCKshbrd, 
neuronal cells substantially derived from floor plate neuronal cdls can be induced by 
contacting the cells with hedgehog, N or C polypeptide. As used herein, the term 
**substantially derived", refers to those cells from the floor plate or proximate to the floor 
plate. For example, such cells include motor neurons and dopaminergic neurons. Those 
of skill in the art will be able to identify other noironal cdls substantially derived from 
the floor plate. Preferabfy the cells are vertebrate cells and most preferably, human cells. 

In addition, as desoibed herein in the Example, hedgehog, and particularly C fragmrat, 
induces the e>q>resaon of pituitary genes. Hedgehog is also eflfective in inducing anterior 
brain gene expression as exemplified by the OTX-A marker. Further, the addition of a 
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TCT-P fimiily member, for example activin, may be used to further induce expresaon of 
such genes. Other TGF-pfim%maxibers win be known to those of sk^ This 
i^arent synogy ofhh fragments with TGF-P fiunily members occurs through the TGF-p 
protein inducing expression of neural inducers such as noggin and fbllistatin. The hh 
fragment then synergizes with these inducers to pattern nairal gene expression. 

hh fingmmts may also be useful as nenre-sparing agents or in restoring or promoting 
appropriate patterning during the healing of major limb trauma. In addition, the N and 
C fragments may be useful in the area of genetic counseling. Specifically, femilial 
midline defects such as cyclopia, Polydactyly or neural tube defects may be diagnosed 
by mapping close to hh. Since autoproteolytic defects may be responsible for the 
disorders, N or C therapy could be provided. 

The invention also provides an autoproteolytic fiiaon protdn comprising a first 
polypeptide including the proteolytic domain of the C polypeptide of the invention, a 
cleavage ate recognized by the first polypeptide, and a second polypeptide (It is 
understood that the first and second polypq>tides can be reversed.) The auto-proteolytic 
activity of the native hedgehog protdn is found entirely within the C polypeptide, 
therefore, the C polypeptide is usefld for producing a fii^on polypeptide v*ich can then 
be deaved at the junction of the C polypeptide and the second polypq>tide.I^^ fiii^on 
protdn may optionally have a purification tag, such as a poly-histidihe tag for isolation 
on a nickd column, or an antibody epitope tag, preferably on the C fragment. The 
cleavage site includes the sequence "GCF", which is recognized by the proteolytic 
domain of the C polypeptide and is utilized to cleave the second polypeptide from the C 
fragment. Also included in the invention is a polynucleotide encoding the fusion protein 
of the invention 

. . .. ,^ ■ ' 

- .■ 

The invention also provides a method for produdi^ an autoproteoly^c fusion protdn 
comprisii^ perably linking a first polynucleotide, wherein the first "p lymidebtide. 
encodes a first polypeptide inchiding the proteolytic domain of the C polypeptide of the 
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invention and the cleavage site recognized by the proteolytic domain, and a second 
polynucleotide encoding a second polypeptide. As described above, the fiision protein 
may also include a carrier peptide and/or a purification tag. 

The C polypq)tide or functional fiagment thereof is useful as a fiision partner to cause 
5 lipophilic modification and tetherii^ of other protems in vivo or in vitro. Such fusion 
. proteins may be de^rable for &ctors whose activity is required in a localized manner, 
either by targeting DNA constructs to specific cells or by introducing cells traiisfected 
with spedfic DNA constructs, for example. It may be desirable to Itpid-modify a 
normally secreted protein in order to produce a cell-associated protdn. For example, it 
10 may be desirable to produce a viral antigen that remains cell associated. 

Alternatively, the C polypeptide or fiinctional firagments thereof can be used as a fijsion 
partner with a protein of interest (e.^.. Protein X fiised to hh-C domain). Such fusions 
form thioesters at the junction between Protein X and hh*C (via an S to N shift). The 
thioesters are then available as substrates for a peptide ligation reaction in ^^ch any 
15 peptide or protein haviiig an amino terminal cystdne (Peptide Y) is added and undergoes 
spontaneous rearrangement (S to N shift) that genmites a stable peptide bond between 
Protein X and Peptide Y (Protdn X-peptide bond-Peptide Y). For example, a protein 
that is toxic when produced in ^ivo could be produced in vitro using the hh-C domain 
fusion protein method. 

20 The fusion polypeptide may also include in optional carrier pq)tide. The "carrier 
peptide*', or signal sequmce, is located at the amino terminal end of the fu^on peptide 
sequence. In the case of eukaryotes, the carrier peptide is believed to function to 
transport the fusion pobl)q>tide across the endoplasmic reticulum The secrcftory protein 
is then transported through the Golgi apparatus, into secretoty vesicles arid into the 

25 extraceUular space r, prderably, the external environment. Carrier peptides which can 
be utilized according to the invention include pre-pro peptides which contain a proteolytic 
enzyme recognition site. Acceptable carrier peptides include the amino terminal pro- 
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region of calcitonin or other hormones, ^ch undergo deavage at the flanking dibasic 
sites. However, it should be noted that the invention is not limited to the use of any 
particular pqrtide as a carri^. Other carrier peptides are known to those skilled in the art 
or can be readily ascertained without undue experimentation. 

5 In one embodimmt of the invention, a carrier pq)tide which is a signal sequence is 
included in the expression vector, specifically located adjacent to the N-terminal end of 
the fusion polypeptide. This ^gnal sequence allows the fusion protein to be directed 
toward the endoplasmic reticulum. Typically, the signal sequence consists of a leader of 
from about 16 to about 29 amino adds, starting with two or three polar residues and 

10 continuing with a high content of hydrophobic amino acids, there is otherwise no 
detectable conservation of sequence known. Such signal sequen(^ are known to those 
of skill in the art, and iru:lude the naturally occurring signal sequence derived from a 
hedgehog protein. 

The fusion polypeptide of the invention indudes a polypeptide encoded by a structural 
15 gene, preferably at the amino-terminus of the fiision polypeptide. Any structural gene is 
e?qnessed in conjunction with the C-polypeptide (polynucleotide) and optionally a carrier 
peptide. The structural gene is operably linked with the carrier iri an expression vector 
so that tiie fusion polypeptide is pressed as a smgle unit. 

The identification of the autoproteolysis of hedgehog into the N and C domains is usefiil 
20 in a screening method to identify compounds or compositions which afiBect. this 
processing activhy. Thus, in another anbodiment, the invention provides a method for 
identifying a composition which affects hh procesm^ v^liich can be detennined by 
activity or gene expression, comprising incubating the components, which inchide the 
composition to be tested (eg., a drug, a small molecule* a protem) and a hh polypeptide 
25 or a recombinant cell expressing hedgehog or a gene encodii^ a C d main or fiinctional 
fragment thereof operably linked to an N domain or fimctional fragment thereof, under 
conditions suffident to allow the components to interact, then subsequently measuring 
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the effect the composition has on hedgehog activity or expression. Fragments of 
hedgehog polypeptide or polynucleotide can be used in the method of the invention as 
long as autoproteolytic activity remains (e.^., the construct exemplified in Figure 1 2a and 
I2b, Example 10). The obs&ved eflfect on hh may be either inhibitory or stimulatory. For 

5 example, one can detmnine whether the N domain is associated with the cell, or whether 
the N domain is secreted into the medium, in other words, whether incomplete processing 
has occurred. Such methods for determining the effect of the compound or composition 
on hh processing include those described herein (see Example 10, Figure I2a and I2b) 
such as time course of autoproteolytic cleavage or course of cleavage based on 

10 concentration ranges. Alternatively, the effect of the composition on hh can be 
detOTnined by the expression of anterior or postmor neural markers. Other methods for 
determining the eflfect of a composition on processing of N and C will be known to those 
of skill in the ait. Various labels can be used to detect the N and C domains, for example, 
a radioisotope, a fluorescent compound, a bioluminescent compound, a chemiluminescent 

15 compound, a metal chelator or an enzyme could be used. Those of ordinary skill in the 
art will know of other 5uitd)le labds or will be able to ascertain such, using routine 
CTqperimentation. 

As used herein, "hh activity" as described in the screening method refers preferably to 
autoproteolytic activity. However, it is undoistood, that one of skill in the art could use 
20 the above-described screening assay to identify a composition having an affect on other 
hh activities, for example, zinc hydrolase activity. Appropriate assays for determining the 
effect on such activities will be known to those of skill in the art. 

The following examples are intended to illustrate but not limit the invention. While they 
are typical of tiiose that might be used, other procedures known to those jskilled in the art 
25 may alternatively be used. 
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HEDGEHOG PROTEIN PROCESSING 

The full Imgth fonn of the hh proteiii (F) migrates vnth a mobility corresponding to a 
relative molecular mass of 46 kD. FIGURES 1 (A) and (C) are immunoblots with 

5 antibodies against amino- (Abl) and carboxy-terminal (Ab2) epitopes. GST fii^on 
protdns containing either residues 83 to 160 or 300 to 391 &om HH protein were 
expressed in Escherichia coli, purified as recommended [F. M. Aurabel» ei al. , Currerit 
PMocals in Molecular Biology (Greene and VWIey-Intcrscience, New York, 1 99 1 )], Md 
used to immunize rabbits by standard methods. The antibodies were affinity purified on 

10 a column of Ifise-U protein [E. Harlow and D. Lane, Antibodies: A Laboratory Manual 
(Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1988)] linked to Affi-Gel 10 
beads (Bio-Rad). The purification was performed as described (Hariow and Lane, supra) 
except that the add and base elutions contained 10 percent dioxane. Biotinylated hh 
antibodies were prepared by purifying the rabbit antisera over a protein A column, 

15 followed by biotinylation with the use of ^e Immunoprobe biotinylation kit (Sigma). 
Immunoprecipitations were performed as described [Harlow and Lane] with the use of 
cold RIPA lyris buffisr containing 0.25 mM phenyteiethylsulfonyl fluoride (PMSF) «nd 
5 mM EDTA fer tissue homogenization. Lysates were precieared twice with pre-immune 
rabbit serum plus protein A beads (Gibco-BRL). Affinity-purified antibodies or 

20 preunmune serum was then added, and the immunoprectpitation was performed with 
protein A beads, with the use of NP-40 lysis buffer for the washes, 

Immunoblots were performed with affinity purified Abl or Ab2 by either of two 
chemiluminescence based protocols. In the first protocol (used in Figures 1, 3, and S) 
^ samples were resolved on 15 pm^ent or 12 percent SPS^olyacrylaniide gelf (jF.^Mv 
23. Ausubel et al., supra) and transferred to Magnagr^h nylon membim^ (MSI) by 
dectroblottii^. Blots were devdoped with the use of an alkal 
donkey anti-rabbit IgG secondary antibody and Lunu-Phos 530 (Bpehrii^er K^^ 
under recommended conditions. In the second protocol (used in HGURE |), samples 
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were transferred to nitroceUulose filters (Schleicher and Schuell), and blots were 
developed u«ng ECL reagents (Amersham) as recommended. The secondary antibody 
in this case was horseradish peroxidase conjugated goat anti-rabbit IgG (Jackson 
ImmunoResearch). Lanes contam protein from induced untransfected S2 cells (lanes 1 
5 and 13), transfected S2 cells induced to e)q>ress hh (hmes 2 and 14), imaginal discs (lanes 
3 and IS), wild type embryos (lanes 6 and 18), and in viiro translations of synthetic h 
mRNA both in the presence (lanes S and 17) and absence of microsomes (lanes 4 and 16). 

cDNAs encoding various hh protein species were cloned into the pMK33 vector, which 
allows for inducible expression under metallothiondn promoter control (Nf . R. Koelle et 

10 al.. Cell 67:59,1991). Stable S2 cell lines were made by transfection of the hh/pMK33 
plasmids with constant selection for hygromycin resistance. Proteins were expressed by 
plating a log phase culture of cells diluted to 0.1 A99, units, waiting 48 hours, inducing 
voth CuSO^ at 0.2 mM final concentration, and harvesting the cells and/or supernatant 24 
hours later. CeU samples for immunoblotting were made by adding 10 volumes of IX 

15 SDS VAGE loading bu£fer to pelleted ceQs. 

In vitro translations were performed with the use of the TNT coupled transcrip- 
tion-trandation system (Promega). methionine (DuPont NEN) was used for detection 
by autoradiography. In the heparin binding expenment in viiro translation lysate with 
microsomes that produce wild-type hh protein was added to heparin agarose (Sigma) or 

20 Sepharose CL-4B (Pharmacia) beads pre-equilibrated with heparin binding buffer (HBB; 
20mMTris(7.4X ISOmM NaCI, 0.1 percent Triton X- 100). Samples were incubated at 
4^ C for four hours with gotle rocking. After pelleting the beads, supematants in some 
samples were analyzed (lanes 2 and 4). The beads were then washed S timi^.>^th chilled 
HBB and samples (lanes 3 and S) were ^bsequently eluted at 80"" C for 10 minutes in 

25 SDS PAGE loading buffer (F. M Ausubeletal.,^ 5^^^ 

Embryos fix>m the wild-type Canton-S line and fipom the matings, hMt/hshh or hshh 
H329A/hsfc* H329A Xy, Sco/CyO. enlacZlI:n4^ (Kassis, et al , Proc. Nail Acad, Sci. 
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U.S.A. S9: 1919, 1992), were collected 0 to 16 hours after egg laying (AEL) at 25° C. 
They were heat shocked for 30 miiiutes at 37° C and aUowed to recover for 1 hour at 25° 
C. Embryos in HGURE 1 (Canton-S) were coUected 4 to 8 hours AEL at 25° C. In 
preparation for immunoblotting, aU embryos were dediorionated in 2.6 percent sodium 
hypochlorite and homogenized in 10 volumes of IX SDS PAGE loading buflfer. 

Multiple spedes were detected and minra- cross reactive bands are seen m most sanq)les 
including extracts of induced untransfected S2 ceHs (lanes 1 and 13). One of these bands 
(occurring in both panels) co-migrates with U (at 39 kD) and is particulariy abundant in 
lane6ofFIGUREl(A). 

FIGURES 1 (B) and (D) are blots of samples immunoprecipitated with Abl (B, lanes 
7-9), Ab2 (D, lanes 19-21), or pre-inunune serum (B, lanes 10-12 and D, lanes 22-24). 
Detection was with biotinylated derivatives of Abl (B) and Ab2 (D). Samples used were: 
induced untransfected S2 ccUs. lanes 7, 10, 19 and 22; transfected S2 cells induced to 
express hh, lanes 8, 11, 20 and 23; and embiyos, lanes 9, 12, 21 and 24. For either 
antibo^, Ml protein fiagmoits were q)edfically immunopredptatcd from hh expressing 
ceils and embryos, but not from untiwisfected cells. (E) In the schematic dia^am, 
cleavage ates are doioted by arrows. The cleavage ate mari^ed by the asteridc is inferred 
by identification of only one cleavage product and mqr ther^re occur at another 
location within the C fragmait. The first two columns to the right of the diagram indicate 
the reactivity of Abl and Ab2 to each hh fi:agment. The otiier columns indicate the 
presence (+) or absence (-) of each M fiagment in the various sanqiles. Paroitheses 
around F and indicate tiiat tiiese species are detected in in vitro translation reactions 
but not //I WW. • 

The 46kD species was detected bom in vitro trandation ©rtracts by. Abl and Ab2 
(FIGURE 1, lanes 4 and 16), and was partially converted to a spedes of 39 kD (U) when 
translation occurred in the presmce of nucrosomes (FIGURE 1, lanes 5 and 17). A 39 
kD spedes co-migrating witii U is also present in extracts fix)m all in vivo sourcies, but 
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none of these extracts contain detectable levels of F. U r^resents the signal-cleaved 
form of F; signal cleavage thus appears to be relatively inefficient in vitro, as reported 
previoiisly, (X J. Lee, e/o/., CeU, 21:33, 1992), but is highly efficient irt vivo To confirm 
that signal cleavage indeed is occurring at this unusual intennol location, a mutation that 
changes reddue Sg, to N at the predicted signal deavage site was mtroduoed. This 
mutation prevented convotion by microsomes of F to U and also produced a species that 
eomigrated with F upon transfection into cultured S2 cells. The effects of independently 
mutating the two methionine codons preset upstream of the signal sequence were also 
examined. In vitro translation of the sequence in which the first methionine is removed 
produces a protein species intermediate in mobility between F and U, and this species is 
converted to a q}ecics that coraigrates with U in the presence of microsomes or when 
produced in vivo. Alteration of the second methionine codon caused no change in the 
electrophoretic mobility of Hh protein produced in vivo or in vitro. 

Smaller species of /ft proteins fiiom in vivo sources have been reported previously (T. 
Tabata and T. B. Kombra^g, Cell 76: 89, i994). The latter study examined not 
endogenous proteins, but proteins induced to express at high levels fix>m exogenously 
introduced constructs. The antibody used did not distinguish epitopes fi^om distinct 
portions of the molecule. 

In addition to signal cleavage, a fiuther cleavage of the U precursor is responable for 
generating other forms of hh protein observed in vivo. This was deduced fi:om the 
diservation that Abl and Ab2 both detected the U (undeaved) spedes, but also interacted 
individually with mialler protdn spedes expressed endogenousty in embryos and 
imaginal discs or with spedes expressed upon introduction of the M geneinto S2 cells. 
Abl thus interacts with a 19kD species from all of these tissues (FIGURE 1^ lanes 2, 3, 
6, 8, 9), while Ab2 interacts with a 25 kD spedes and a 16 IcD spedes (FIGURE 1, lanes 
14, 15, 18, 20, 21). The 19 kD spedes hereafttt- is referred to as N (N-termirial fiigment), 
the 25 kD species as C (C-terminal fragment) and the 16 kD spedes as C*; these spedes 
represent the major forms of endogenous hh protem present in vivo. 
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The proposed cleavages by whidi these species arise are shown schematically in the 
bottom portion of FIGURE 1 . The N and C species are uniquely detected by Abl and 
Ab2, respectively, and the sum of the relative masses of the two smaller species is 
roughty equivalent to the rdative maiss of U. The dectrophoretic mobilities of the F and 
5 U species are somewhat at variance with thdr predicted relative masses (S2. 1 kD and 
43.3 kD, respectively). The identities of these species w^e confirmed by in vitro 
translation of a variety of hedgehog open reading fiames modified to contain different 
extents of sequence at the NHj- or COOH- terminus, and by insertion of epitope tags. 
The migration anomalies appear to be associated with protein ^ectes in which sequences 
10 fi-om both the NHs- and COOH-terminal Augments are simultaneously present. The 
mobilities of the NH^- and COOH-terminal fragments, in contrast, correspond to relative 
masses (19 kD and 25 kD, respectively) that sum to yield 44 kD, roughly equivalent to 
the expected relative mass of U. 

A sin4>le mechanism that could account for the derivation of the two smaller ^edes 
15 therefore would be a single internal cleavage of the U precursor. Processing of the hh 
protein when translated in vitro also yields a 25 kD species (C; laiws 16 and 17) and 
eitf)era29kDor 19kD(^Q^edesOanes4and5). The 19 kD species comigrates with 
N, and its formation depends upon the presence of microsomes, consistent with the 
proposal that N derives fiom F by ^gnal cleavage and a fimher internal cleavage. The 
20 overall pathway fi>r formation of the predominant forms of hh protein observed in vivo 
thus appears to involve signal deavage of F to generate U. U is then cleaved internally 
to form N and C, which are the predominant forms found in vivo. Further processing of 
the 25 kD C species might then gen^ate the 16 kD C* species, but whether this 
processng is a angle deavage event or not is not clear since Ab2 does not recognize the 
25 smaller 9 kD firagment that would result. The processing of C to generate G* appears to 
occur with greater effidency in imaginal discs as conq)flii^^^fo (compare lanes 

15 and 18); this may be caused by the more extended mass isolation procedure of 
ima^nal discs (O. M. Eugene, etal,. Tissue Culture Assn. Man., 1: 1055, 1979). 
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EXAMPLE2 

AUTO-PROT EOLYSIS OF THE HEDGEHOG PROTEIN 

The ccnnigrBtion of endogenous and in vitro-gmeraied hh protean species suggested that 
in vitro processing is amilar to that observed in vivo. FIGURE 2 shows limited sequenc 
amilarity between hh proteins and serine proteinases, hh proton sequences are aligned 
to residues 323 to 329 of the D. melanogaster protdn and mnnbered as positions 1 to 7 
(group A). Consoved hh residues are in bold letters. The catalytic histidines (A J : > 
Barrett, m Proteinase inhibitors k. J. Barrett, G. Sahresen, Eds. (Elsevier, Amsteidam, 
1 986) pp. 3-22) of mamnialian serine protdnases (group B) are aligned to the invariant 
Wstidine at poation TmHh protons. Abbreviations are as foUows: C-Shh, chidcen 5<»i/c 
/* (R. D. Riddle, etal, Celll^. 1401, 1993); M-S«r, mouse Sonic hh (Y. Echelard et ai, 
CeU 2S: 1417, 1993) (identical to flZig-1; R vM-I, rat vhh-\ (H. Roelink et al., Ce//2fi: 
761, 1994); Z-Shh, zthraMi Sonic hh(S. Krauss, etal, Cell 25: 1431, 1993) (idoitical 
to shh) and zebrafish \hh-l, (H. Roelink et al.. su/n-a); twhh, no otho- abbreviation; 
M-DM, mouse Dexrt Ml (Y. Echdarci et al., Ce//25: 1417, 1993); M-Uth, mouse Indian 
hh (Y. Echelard et al., supra); CHT, bovine chymotiypsin; TRP, bovine trypsin; ELA, 
porcine elasteoe; UKH, human urokinase; CIR. human complement fector IR; CIS, 
human complement fector 1 S; MCP, rat mast cell protease; FAX, human blood clotting 
fector X; TPA, human tissue plasminogen activator. 

Hgure 2 shows that a seven residue region athh coding sequoice (residues 323 to 329 
in the Drosophila proton) di^lays some similarity to the sequences of serine proteases. 
Hus region lies approxinutely two thirds of the distance fiom the signal cleavage site to 
the carboxy-termmus, and includes Thr and I£s, reddues (positions 4 and 7 in FIGURE 
2) tfiat are invariant amopg all hh sequences fiom all spoaes. In the serine proteases, this 
conserved sequence contains an invariant His that acts as a general base in catalysis (A 
J. Barrett, in Proteinase inhibitors A J. Barrett, G. Salvesen, Eds. Elsevier, Amsterdam, 
1986, pp. 3-22). - 
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To determine Aether this invariant His residue in the A/r protein indeed plays a role in 
auto-proteolysis, two proteuis fiiom E. coli were purified: one carried the wild type 
sequence and the other a substitution of an Ala codon for the His codon at position 329 
(H329A). Both of these proteins were engineered to contain a hexa-histidine tag at the 
s amino terminus fused to Drosopkila sequences extending fiom a residue just before the 
signal cleavage site to the carboxy-terminus (residues 83 to 471; the wild type foitn of 
this protein is referred to as Ifis«-U); Both proteins w^e extensively purified under 
denaturing conditions using a Ni^ -chelating matrix. FIGURE 3(A) is a cobmasie blue 
stained polyaaylamide gel that shows the production and purification of HisrU and 

to His6-UHa29A proteins fi-om K coli. Samples were molecular weight markers (lanes 1 and 
2); lysates of £ coli cells carryiqg the FfiSs-U expression construct without (lane 3) and 
with (lane 4) induction by IPTG; purified Hi^-U protein (lane 5); lysates of K coli cells 
that cany the HisrUttm expression constnict without (lane 6) and vnth (lane 7) induction 
by IPTG; purified His«-UH»tA protein (lane 8), Purified proteins were essentially 

15 homogeneous except for several minor species of lower relative mass; these species are 
endogenous breakdown products of the fitlt-length proteins since they %vere absent in 
uninduced extracts and were detectable with hh antibodies. FIGURE 3 (B) is an 
immunoblot detected witii Ab2 showing transected S2 cells induced to e>q>ress hh (lane 
1 ); HivU and HtSrUiDin^ proteins incttbated in cleavage reaction buffer for 0 hours (lanes 

20 2 and 5), for 20 hours (lanes 3 and 6), and for 20 hours in the presence of 20 mM TAME 
(a serine protease inhibitor) (lanes 4 and 7), Upon incubation the His«-U, but not the 
HiS(-Uii)29A protdn, released a fi^agment presumed to be C on die basis of reactivity with 
Ab2 and co-migration wi^L C produced in S2 cells. Release of C (lane 3) was only 
partially inhibited by TAME. 

25 Preliminary proteinase inhibitor studies have been performed on in vitro translated Hh 
protein by adding various inhibitors at the start of the translation reason. These studies 
have been complicated by the fict that numerous protease inhibitors lower or block 
translation eflSciency. In some cases the effectiveness of an inhibitor was assayed by 
determining if addition of an inhibitor to a completed translation reaction will inhibit the 
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self-procesang that normally continues to occur. At this time we can only state the 
following with certainty: (i) the serine protease inhibitor TAME 
(p-tohienesulfonyl-L-ai^inine methyl ester) inhibits auto-proteolysis of in-vitro translated 
Hh protem; (ii) soybean trypan inhibitor, a, anti-tiypsin, {4>rotitin, laipq}tui, and E-64 
do not block auto-proteolysis of translated Hh protein; and (iii) TAME partially inhibits 
auto-proteolysis of purified HivU protein (FIGURE 3, panel B). 

As seen in FIGURE 3B, upon dilution of denaturant the wild type prot^ but not flie 
H329A mutant protein released a 25 kD species detectable by Ab2 and identical in 
mobility with the C species produced from in vriro translations and various in vivo 
sources. This cleavage was also observed vrtiai the wild type protein was purified and 
renatured by other protocols and cleaved under distinct conditions. Plasmids encoding 
ftc HisrU and His6-l^A pxrtans were generated by inserting sequences corresponding 
to residues 83 to 471 fiom the wUd-type or Mi H329A ORF into the pRSETB expression 
vector (Invitrogen). Proteins were induced in BL21 (DE3)/pLysS E. coli cells as described 
(F. M. Ausubel et al , supra). The basic purification was pofonned on Ni-NTA ^arose 
beads (Qiagen) by a denaturing protocol with the use of 6 M guanidimiim HCl and 8 M 
urea essentially as recommaided (a detailed protocol of exact conditions used is a>iilable 
upon request). Washes contained 0.2 percent Tween 20 and 5 niM b-mercapt6ethanol 
The final wash buflFer was: 6 M urea. 100 mM Tris, 500 rtiM NaCl, 20 percent glycerol. 
(pH 7.4). Ehitkms were with the final wash buffer containiog 250 mM imidazole. In vitro 
deavage reactbns were performed by incubating the purified protein (diluted 1:30 in the 
final nrix) in deavage buflfer [50 mM Tris, 500 mM NaCl, 5 percent glycerol, 0.2% Triton 
X-IOO. 50 mM DTT, (pH 7.4)]. To isolate soluble fiill-length HivU protein free from 
denaturants or detergents, additional steps were taken (this refers to the other renaturation 
protocols mentioned in the text). Full-length protein from the eluate described above was 
fiirther punfied fix)m breakdown products by precipitation, V urea removal through 
dialysis. The predpitate was then re-solubilized m a buffer containing guanidinium HCI 
and loaded onto another Ni-NTA agarose cohmui. After washing as described, the protein » 
was re-fold(id (whUe attached to the beads) by gradual dUution of urea (from 6M to 0 .5M) . ^ 
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whh dihition buflfer ((100 mM Tris, 500 niM NaCl, 20 percent glycerol, (pH 7.4)] over 
an 8 hour period at 4" C. The protein was duted with dilution buflfer containing 250 mM 
imidazole and 0.5M urea. The ehiate was dialyzed in 100 mM Tris, 150 mM NaCl, 10 
percent glycerol, (pH 7.4) at 4° C and stored at -70° C. 

5 MArni^y 

MAPPING THE Af ITO-PROT EOI.YTIC FUNCTIONS Of hh 

To more predsdy define the domain of the hh protein respon»ble for this auto- 
proteofytic event, the ^fects of several distinct types of mutations upon in vitro 
prooesang were examined. The most informative mutation was a ddetion that removes 
10 residues 89 to 254 (A89-254), which t(%eth«r constitute most of the amino acids vwthin 
the portion of the molecule preauned to form the N fragment. In vitro translations of 
vwld-type and mutant Hh proteins from Drosophila (FIGURES 4 A-C) and zebrafish 
(HGURE 4D) are showii The locations of mutations and cleavage sites (arrows) in these 

pmtaas are sdiematicalty iUustrated (FIGURE 4E). In the Droi$«ipAito protean (FIGURES 
4A, B, and C),auto-proteolysis is blocked or severely inhibited by several mu 
the COOH-tenninus (H329A, 294 trunc, 410 tnmc, flu408 add 456 tiunc), but is 
unafibcted by a large deletion (A89-254) or insertion of a flu-tag epitope trimer (flu227) 
in the NHrtwininus. Auto-proteolyas thus depends primarily on residues within the C 
fragment (sequences to the ri^ of the cleavage ate in the diagram below, see FIGURE 

20 1). Furthermore, the H329A/flu227 double mutant is not deaved by wild-type protein in 
a mixing experiment (lane 11), suggesting an inttamolecutar mechanism for auto- 
proteolysis. Hh proteins encoded by the zebrafish genes twhh and shh display a patton 
of processing (D) nmilar to that of the Drosophila protein ahhougti Ae NH,-tenninal 
fragment of each zebrafish protein (23 m fair twhh atid 22 kb for ^M) has a lower 

25 apparent mass than the COOH-terminal fragment (25 kD ior twhh and ^h} This is the 
resuk of a shorter stretch of readues that precedes die signal sequence as coinpared to 
the Drosophila protein. Processing is blocked by H273 A and H270A mutations in twhh 
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and shh protdns respectively (analogous to the H329A mutation in the Drosophila 
protein^ vMdlx suggests an auto-proteolytic processing mechanism is used similar to that 
observed for the Drosophila proton. 

In vitro translations were performed with the use of the TNT coupled 
5 transcription-translation system (Promega). '^S methionine (DuPont MEN) was used for 
detection by autoradiognq^hy/ In the*heparin binding e7q)enment (FIGURE SC), in vitro 
translation lysate with microsomes that produce wild-type Hh protein was added to 
heparin agarose (Sigma) or Sepharose CL-4B (Pharmacia) beads pre-equilibrated with 
heparin binding buff^ (HBB; 20 mM Tris (7.4), ISO mM NaCl, 0.1 percent Triton 
10 X-100). Samples were incubated at 4*" C for four hours with gentle rocking. After 
pelleting the beads, supematants in some sanities were analyzed (lanes 2 and 4). The 
beads were then washed S times with chilled HBB and samples (lanes 3 and 5) were 
subsequently duted at 80° C for 10 minutes in SDS PAOE loading buflSer (F M. Ausubel 
et al., supra). 

15 All mutations in the hh gene were generated in the pla^d pF 1 (J. J. Lee, et al , supra). 
Mutations in the zebrafish /M^Aft and genes were generated with the origuial cDNA 
dones as described (Ekker, et aL, Current Bfology, KS): 944,1995). All point mutations 
were generated with the use of reconibinant circle PCR (D. H. Jones and S: C. 
Winistorfer, Biotedmiques 12: 528, 1992). The flu408 and flu227 mutations were 

20 generated by inserting a trimer of the influenza hemagglutinin antigen (42 residues for 
flu408 and 43 residues for flu227) into the AlwN I and Bgl I sites present in the hh ORF 
(nucleotide portions 1604 and 1058 respectively) (J. J. Lee, et al, supra). The A89-254 
mutation was generated by ranoving sequences between the EcoN I site (644) and the 
Pml I ate (1 145). The 294 tninc muUttion was generatedj)y removing sequences between 

25 the Acc I site (1265) and the Xcm I site (1792). The 4 10 trunc mutation was previously 
genoated and idaitified as Hh^io (J. J. Lee, et a/., supra). To map the mutation in the hh 
allele (base change C^m to A; coding change Tyis? to STOP), DNA isolated from 
Mi^UMi was used to seed PCR reactions generating regions of the hh ORF and flanking 
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sequences, which were subcloned into Bhiescript KSM (Stratagene). Six clones each, 
derived from two different PCR anu>lifications were sequenced. 

As seen in lanes 1 and 2 of FIGURE 4A, this construct generates a fiiU length species of 
a mobility corresponding to the expected relative mass of 33 kD, and two cleaved 

5 products vi*OM apparent reUtive masses (25 and 9 WD) sum to give 

the larger species. The smalls of the cleaved products will occasionaUy migrate as two 
bands as seen in Fig 4A. We have chosm Ae lower of the two bands between the 14. 3-kp' 
and 6.2-kD markers for our molecular weight measurement. The larger of the two 
cleaved products comigrates wiA the C species produced from the wUd type protein, 

10 suggesting that the A89-254 hh protein contains the residues normally present in C and 
all of the determinants required for auto-protcolysis, including the normal cleavage site; 
most of the r^dues within N are dispensable for auto-proteolytic activity. 

In contrast, lesions affecting residues presumed to He withm C blodc auto-proteolysis in 
vitro. All mutations tested by in vitro translation were also examined in S2 cells by 

15 immunoblottmg In aW cases the patterns of cleavage in S2 cells were identical to those 
observed in tran^ons except that C* was ahvays preswit whoever C was formed. The 
former fragnietit was not observed in translations. These include the H329A mutation 
described above, a mutation that inserts an influenza virus epitopic between residues 408 
and 409 (flu408), and three mutations that cause premature terminatiofi of the protein at 

20 the carboxy termirms. The two most severe truncations, 294 trunc and 410 trxmc, are 
mutations generated wiw/lra. Theycausealossof 177and61 residues, respectively, from 
the carboxyl-terminus of the protein, and ncather undergoes proteolysis. The 456 trunc 
hh protein is like that encoded by the EMS-induced hh^^ mutant allele, which results in 
the loss of 15 re^dues from the carboxy-terminus of the protein. This prota^ VP^g^^s 

» ^ auto-proteolysis, as demonstrated by the appearance of a 24 kD band in place of C, ^ 
the efficiency of the reaction is much impaired in vitro (nOURE 4B). Autp'-proteolysis 
of the Mt protein relies mainly iq)on residues withm C; deletion or aheratipn of residues 
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within this domain is associated with reduced effidency of processing, and one such 
deletion appears to be the cause of the M"^ mutation. 

The sequence homology and auto-proteolytic function of the full length hh protein 
suggested the possibility that F or the C fragment is a sequence-specific protease. As a 
5 first step in clarifying the mechanism of auto-proteoly^s, an influenza virus q>itope tag 
was introduced into the N-terminus of a Aft open readmg frame that also carried a H329A 
mutation. FIGURE AC shows that the ins^on of the epitope tag alone doies not interfere 
with auto-proteolysis (lane 9), and yields a ndrmal C fragment and an N fragment of 
increased relative mass (compare to wild type in lane 12). The protein carrying^both 

to mutations does not undergo proteolysis (lane 10), and since the epitope-t^cd N 
fragment migrates differently from N, this double mutant provides an ideal substrate to 
look for intomoiecular cleavage upon nuxture with a wOd type sequence. Lane 1 1 shows 
that in sudi a mixture, ahbough normal N is formed, no tagged N can be detected. Thus, 
in this experiment, no appredable intermolecular cleavage occurs. We also &iled to 

15 detect intermolecular cleavage in the following two experiments: (i) co-transfection of 
wild type and 410 trunc sequences into S2 cells (the cleaved 410 trunc protein would 
yield a smaller and tho^efore identifiable form of C); (ii) mixii^g of excess uidatelled^ 
purified HiSfi-U protein with labelled, in vitro translated H329A mutant protein. Thus, 
although an intermolecular mechanism for regiilation of auto-pipteolysi^ or for cteavage . 

20 of other proteins can not be ruled out, the current evidence suggests that cleavage of the 
hh protein occurs predominantly by an intramolecular mechanism 

The hh gene has been broadly conserved in evolution, with single homologues 
unidentified in a wide variety of invertebrate species and multiple distinct homologues 
in each of several vertebrate species (Y. Echelard ct al.. Cell 7^: 1417, 1993; S; Krauss, 
25 e/ a/. Cell Tg: 143 1, 1993; H. Roelink et al., Cell, supral As seen in FIGUEUE 2, aU of 
these coding sequences contain an invariant histidine and other conserved residues at a 
position corresponding to H329 in the Drosophila protein. In addition, the protein 
encoded by at least one of the mouse genes appears to be processed w ww to yield two 
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smaQer species in a manner resembling the in vivo processing of the Drqsophila protein. 
To determine whether auto-proteoiysis may also play a role in vertebrates we examined 
the behavior of proteins encoded by two distinct hh homologues from the zebrafish, twhh 
and sMi. FIGURE 4D dononstrates that \siien these sequences are translated in vitro, 
5 smaller spedes are generated whose rdative masses sum to yield approximately the 
relative mass of the fijll length protdn (lanes 1 and 3). As seen in lanes 2 and 4, this 
cleavage reaction is blodced by substitution of Ala codons for the His codons at positions 
corresponding to H329 in Drosophila (see FIGURE 2). Vertebrate /rA proteins thus 
appear to be processed by a ^milar mechanism as the />ro5qpA/£3 prot 

to P3^MPy.E4 

ROLE OF AUTO-PROTEOLYSIS IN EMBRYOS 

Numerous fiinctions for the hh gene have been described in Drosophila. At the 
morphological level these include a role in patterning of larval cuticular structures and 
adult structures such as the eye and appendages (C. Nusslein-Volhard and E. Wieschaus, 

15 Miftir«2S2: 79S;i980; and I Mohler, ^petics I2£l: 1061, 1988).; the mechanistic basis 
for thee morpholQ^cal eflfects involves agnaling for maintmance or induction of g&ie 
. e)q[>ression m embryos and imaginal discs (J. J. Lee/iigira; T. Tabata and T. B. Kombefg, 
Cell 16: 89, 1994; and K. Easier and G Stnihl,>irwr<r 2fiS: 208, 1994). To ascertain the 
importance of auto-fvoteolysis fixr these fimctions, the H329A mutant gene under control 

20 of the hsp 70 promote was introduced by P demmt-mediated transformation into the . 
Dro50/?/i//a germlina The hsWi H329A construct was made identically to the hsA/i 
construct with the use of a hh ORF fragment containiiig the H329 A mutation; Transgmic 
flies we»^ generated from ay n^^^parental strain usitig standard m^ods of P element 
mediated transformation (A. C. Spradling and G. M. Rubin, Science 218: 341 1982). A 

25 line, HA3, carrying the hsM H329A P elemmt on the second chromosome was 
maintamed as a homozygous stock. To assay for expansion of stripes, embryos, 
oollectedat4to6hoursafter gg laying (AEL) at 25 " C were subjected to the following . 
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heat shock {votoools prior to fixation. Embiyos receiving atigle diocks (10 or 30 minutes 
at 37" C) vmt aOowed to recover for 1 hour at 25" C. Embiyos receiving double shocks 
(two 10 munite or two 30 minute shocks at 37° C) were allowed to recover 90 minutes 
after the first shock and 40 minutes after the second (Both recoveries were at 25 ' C. The 
double 30 minute protocol was as previously described, (S. Krauss, sigjra). In situ 
hybridizations were performed as described (D. Tautz, Chromosomal. 81, 1989) using 
a wg specific probe (D. T. Chang et al., stqna). EmKiyos assayed for cuticle phenotype 
were heat shocked 6 to 8 hours AEL for 30 minutes at 37" C, aUOwed tO develop at 25 ° 
C for 36 hours and then processed and mounted as described (M. Ashbumer, Drosophila: 
A Laboratory Manual. Cold Spring Harbor Uboratory Press, New Yoric, 1989). 
Immunolocalizations (single or double stains) were performed as described. With the use 
of affinity purified Abl or Ab2 for the primaiy antibody and alkaline phosphatase (AP) 
or horseradish peroxidase (HRP) conjugated anti rabbit or mouse IgG (Jackson 
Immunoresearch) for the secondary. Embiyos fix>m a AA*^/TM3 /tt-lacZ (the balancer 
chromosome was fi^om the Bloomington Stock Center, strain 3218) stock homozygous 
for the hh"^ allele were identified by the tack of stainmg with an anti b-galactosidase 
antibody (Promega) in a double stain with Ab2 (FIGURE 9. panel D). Staining in 
FIGURE 9, panels B and C were performed formaldehyde fixed Canton-S embryos with 
the use of an AP conjugated anti-rabbit IgG secondary. Ahhough standard formaldehyde 
fixation >fns generally used, heat and acid-formaldehyde fixation also gave simitar 
results. GST fusion proteins containing either residues 83 to 160 or 300 to 391 from the 
Hh protein were expressed in£: coli, purified as recommended (F. M. Ausubel et al., 
supra), and used to immunize rabbits by standard methods. The antibodies were affinity 
purified on a column of HisrU protein (Harlow and Lane, siq>ra) linked to Affi-Gel 10 
beads (Bio-Rad). The purification was performed as described (Hariow and Lane, sw^jS). 
except that the add and base elutions contamed 10 percent dioxane. Biotinytated M 
antibodies were prepared by purifying the rabbit antisera over a protdn A column, 
foUowed by biotinylation with the use of the Immunoprobe biotinylation kit (Sigma)^ 
Immunopredpitations were performed as described (Harlow and Lane, supra) with the 
use f cold RIPA lysis buflfer containing 0.25 mM PMSF and 5 mM EDTA for tissue 
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homogeiuzation. Lysates were precleared twice with pre-immune rabbit serum plus 
protein A beads (Gibco BRL). Aflfinity purified antibodies or pre-immuiie serum was then 
added, and the immunopredpitation was performed with protdn A beads, with the use 
of NP-40 lysis bufi&r for the washes. 

s FIGURE 5 (A) and (B) are immunoblots developed with the use of Abl and Ab2 
antibodies reqjectively. Lanes 1 and 6, induced untransfected S2 cdls; lanes 2 and 7. 
transfected S2 cells induced to express hh; lanes 3 and 8, heat shocked wild-type 
embiyos; lanes 4 and 9, heat shocked bshh embiyos; lanes 5 and ^0, heat shocked hshh 
H329A embryos. In heat shocked hshh embiyos, the wild-type Hh protein is both induced 

to and properly processed to goierate the U, N C and C* spedes seen in other expression 
contexts. In ccmtiast, Ae H329A is induced but not q>preciably processed in hsAA H329A 
embryos (the low levels of processed ^ecies in lanes 5 and 10 are probably from 
endogenous hh eiqjresnon smce th^ are seen at identical levds in heat shocked 
wild-type embryos in lanes 3 and 8). 

15 FIGURE 5 shows that heat shock induction resuhs in the formation of an abundant 
spedes that corresponds to'U based on its molntity and its interaction with Abl and Ab2 
Oanes 5 and 10). In contrast, induction of wild type M prqtdn usng a amilar contnict 
resulted in similar levds of the N and C processed products Oanes 4 and 9), with very 
little uncleaved U. Thus, as observed in vitro and in S2 cdls, theH329A mutation in 

20 embryos appears to greatly reduce the eflBdency of auto-proteotytic cleavage of the hh 
protein. 

In FIGURE 6, the embryonic distribution of wingless (h;^) RNA as reveded by in dtu 
hybridization is shown in FIGURE 6 (A) wild-type (homozygousy w""), (B) hs^. and 
(C) hs** H329 A embryos that were exposed to two 10 minute heat dipcks separated by 
25 a 90-nunute recovery pmod. Wild-type embryos showed littie change in wg ocpressibn, 
whereas the wild-type protdn and, to a lessw exteat, the H329A protein eadi induced 
ectopic wg expression (Table 1). Panels (D), (E), and (F) show the dorsal surfaces of/ 
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w"'*, hshh, and hshh H329A larvae, respectively, at the level of the fourth abdominal 
se^gment. These larvae were shocked for 30 minutes as embryos and allowed to complete 
embiyogcnesis. Cuticle cell types (r, 2", 3% and 4**) are labeled as described <J. 
Heemskerk and S. DiNardo, Cell 2^: 449, 1994). Note the expansion of 2° cell types 
(naked cuticle) at the e:q)ense of 3' and some 4** types in the hshh embryo (E) under 
conditions vAiere the phenotype of hsWr H329A embryos (F) is identical to that of control 
eihbryos (D). 

Perhaps the earliest known requirement for Hh protein is in mamtenance of an adjacent 
stripe of wireless (wg) gene expres^on in each embryonic segment (A. Martinei Arias, 
eioL, Development m:^ 157, 1988; and S. DiNardo. eial.. Nature 332: 604. 1988). This 
requiremem is deduced from the loss of wg expression when hh function is absent; in 
addition, the ubiquitous expression of wild-type Hh protein induces expansion of the 
domain of wg gene expression (P. W. Ingham, Nature 2§6. 560, 1993). The effects of the 
H329A mutation upon wg e7q)ansion were examined by heat shocking embiyos carrying 
the H329A mutani: construct in parallel with embryos containing the wild-type construct. 
Aldiougjh the H329A mutant protein is able to induce some expansion of the wg domain, 
the effidency of this activity is impaired relative to that of the wild-type protein 
(FIGURE 6, B and G; Table 1). The difference in efiSdency ranges nearly as high as 
threefold depending upon the heat shock regime, and these results suggest that a 
uto-proteolysts of the Hh protein is inq[>ortant for optimal activity in embryonic signaling 
to induce expression. 
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JASLLl 

Wild-type and mutant hh acthritv in embryonic inductinn of wp Mpression' 

minutes of heat shock 
10 30 10/10 30/30 

hshh 1.0 ±0.3 (93) l.S±0.6(120) 2.9 ±0.3 (41) 2.8 ±0.4 (54) 

hsMrH329A 0.7±0.5(190) 0.9±0.4(111) 1.1±0.4(145) 1.9±0.5(93) 

* E7q)an^on of wg expression beyond wild*type controls is given as average number of 
cell diameters ± standard deviation with niunber of onbryos scored in paroitheses. 
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The eSBscts of Hh protein on the patteming of cuticular structures are most clearly visible 
on the dorsal sur&ce of the larva, where four distinctive ceU types can be identified in 
each parasegment. These cell types have been designated 1**, 2,** 3*", and 4*, from 
anterior to posterior, with hh transcription occurring in precursors of the 1*" cells (J. 
5 Heemskerk and S. DiNardo, stpra), Dififerentiation of the first three cell types was shown 
to be dependent upon hh gene function, and it has been proposed that the fiites of these 
cells are determined by the concentration of Hh protein, with highest concentrations 
producing the 1 ° fitte, intermediate concentrations producing the 2*" fate, and the lowest 
concentrations producing the 3^ &te (J. Heemskerk and S. DiNardo, siq>ra). This 

10 proposal was supported by observations that the most anterior celt types display the 
greatest sensitivity to a reduction of hh expression, and that all of the 3 " and some of the 
4'' bristles are replaced by naked cuticle characteristic of the more anterior 2"* cell type 
when Ml is expressed ubiquitously at high levels. We have reproduced suppression 3 and 
some 4° &tes by heat shock induction of embryos that carry our wild-type construct 

15 (FIGURE 6E), but find that the H329A mutant is unable to alt^ cell fates m the dorsal 
cuticle of the larva (FIGURE 6F). Auto-proteolysts, or perhaps some other fiinction 
blocked by the H329A mutation, thus appears to be essential for the patteming influence 
of Hh protein upon the dorsal cuticle. 



EXAMPLES 

20 EFFECTS OF THE H329A MUTATION 

WQJi gyCNA MNG IW IMAQINAL PK C S 

Studies of H329A mutant protein were extended to the fimction to the patterning of adult 
structures and sij^ialing within iihagtnal discs. In the eye imaginal disc /rA' fiinction is 
required for appropriate development of pattern (J. Mohler, Genetics 12Q: 1061, 1988; 
25 J. J. Lee, sE(prtr, and I Mohler and K. Vani, supra) and more recently has been shown to 
control progression f a wave of differentiation via induction of deoapentcpkgic (f^p) 
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gene expression in the moiphogenetic furrow of the eye (U. Hd)erlcin, et al. Cell 2^: 
913, 1993; and C. Ma, ei al. Cell 2S: 927, 1993). In leg and wing discs, ectopic 
expression of /i/i has also been shown to yidd pattern duplications and defects and is 
associated with induction of ectopic expression of other signaling molecules normally 
5 expressed in a zone along the anterior/posterior compartment boundary (T. Tabata and 
T. B. Korabeig, Ce// 76: 89, 1994; and K. Basler and G. Struhl, Nature 268: 208, 1994), 

For studies of signaling in imaginal discs, a thermal cycler was utilized to subject larvae 
carrying hedt shock-inducible hh constructs to successive rounds of heat shock and 
recoveiy. The effects of tenqsmture cycling upon expression of d^p and ^A^g in imaginal 

10 discs was examined by monitoring fi-galactosidase expres^on from a reporter gene 
carrying 4p promoter sequences or from an enhancer detector P dement inserted in the 
wg gene. In FIGURE 7, X-gal staining was used to follow expression of y^g FIGURE 7 
(A-C) or (^p FIGURE 7 (D-L) in imaginal discs of late third-instar larvae that carry 
wg^Zotd^lacZtepontx gmes. Leg (A-F), wing (G-I) and eye-antennal discs (J-L) 

15 from control larvae (A. D, G, J), larvae canrying the hshh transgene (B, E, H, K) and 
larvae carrying the hsA/r H329A transgene (C, F, I, L) are displayed. In all pands anterior 
is to the left. Arrows highlight the following features: an ectopic patch of ^p expression 
in the anterior compartment of wing discs in hsM H329A larvae (I); and an ectopic band 
of expression in eye portion of the eye*antennal disc anterior to the morphograetic 

20 fiimow (marked by the other band of (^p cxptesskon more posteriorly) in hshh larvae (K). 
Expansion into the anterior compartment of wg repression in leg discs, and i^p 
expresson in leg and wing discs in hshh larvae is similar to that described for the ectopic 
expression of hh. Morphological changes in the anterior compartment of leg (B and E) 
and wing discs (H) were also as described (K. Basler and G. Struhl, supra). In contrast, 

25 discs from hshh H329A and control larvae showed very little change in n^g and 

« expresfflon, even undo* prolot^ged heat shock conditions and morphological changes^ were 
never observed. (M-O) The eye phenotypes of aduh control (M), hsWr (N) and hsft/r 
H329A (O) flies duit were shocked during larval development in a manner simihir to that 
of the imagjnal disc experimmts above. Duplicated eye structures were observed in hshh 
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flies, but never in hshh H329A flies. The arrow in (N) points to a thin strip of cuticle 
between the two eye structures. Other deformities were also seen in hshh flies (fior 
example, compare the thorax in N to M) . 

Virgin female flies fix)m the homozygous lines hshh (D. T. Chang et al., Development, 
s 1 994, in press), hshh H329A» and / w"^* were crossed to males fix>m the homozygous 
BS3 .0 fine (beariqg a P element i^p reporter construct on the 2nd chromosome, referred 
to as 4>p4acZ) (R. K. Blackman, et al. , Development JJl: 657, 1 991) or the line Sco/ 
CyO, enlacZJL rwg (beariiig a wg reporter P element enhancer trap on a second 
chromosome balancer, called wg-lacZ) (J. A. Kassis, et al., Proc, Natl Aaxd. Set U.S.A. 

10 89: 1919, 1992). Progeny were grown at 25° C in aerated O.S-ml microcentrifuge tubes 
containing yeast paste until the late second instar or early third instar stage of lan^ 
development. The larvae were then cycled continuously at 3V C for 30 nunutes followed 
by 25"* C for 90 nunutes b a Perkin-Elm^ thermal cycler until th^ readied the late third 
instar stage. They were subsequently dissected and stained with X*gal as described (M. 

15 Ashbumer, siqyra) or allowed to grow to adulthood for phenotypic analysis. 

As shown in FIGURE TA, wg e}q>ression normally occurs in a ventral sector of the leg 
disc along the anterior/postmor con4>artment t>oundary vMe i^p is ^r^sed in the 
doi^al portion of the disc along this boundary (FIGURE 7D). Although thermal cycling 
" of larvae earring the wild-type hh gene produced abnormal leg disc moiphology and 

20 extensive ectopic e)q)res^on of both target genes, as previously reported for ectopic hh 
expression (FIGURE 7B and E), the H329A construct produced little if any delectable 
differ^ce in these patterns of expression (FIGURE 7, C and F). Ectopic hh e)q>ression 
in the wing disc also leads to morphological dumges and expanded expres^on of f^/? 
(compare FIGURE 7, G and H), but the H329A construct produced oidy an bipcasioiuil 

25 small patdi of anterior ectopic expression (FIGURE 71). 

Ubiquitous expression of wild-type hh also leads to ectopic expression of (^p in the 
eye-antemial disc (compare FIGfURE 7, land K). In the antennal portion of this disc the 
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expansion of dpp expresaon resembles that observed in leg discs. In the eye portion of 
the disc eiqiresaon is observed at its normal location in the fiirrow; howev^, ectopic 
expression also occurs in the form of a second dorso-ventral band at a location somewhat 
anterior to the fiinow, thus giving the iq>pearance of an eye disc with two morphogenetic 

5 furrows (FIGURE 7K). Indeed, in adults dmved from temperature-cyded larvae that 
cany the wild-type hh constmct, an apparently duplicated eye structure such as that in 
FIGURE 7N can be observed, with two eye structures separated by a thin strip of cuticle 
(arrow). The H329A mutant protein, in contrast, did not induce expansion of 
e?q>ression in either poition of the eye-antennal disc (FIGURE TL), and does not induce 

10 eye duplications or cuticle defects in the adult (FIGURE 70). 

The experiments described thus &r comprise multiple series of hrvae subjected to two 
days of thermal cyding followed by immediate dissection for analysis of imaginal 
structures or further incubation at constant temperature for analysis of adult structures. 
Although the H329A protein iq[>peared to have little activity in these experiments, the 

15 small patch of ectopic dpp expression induced in the wing disc (FIGURE 71, arrow) 
miggested that some residual activity rsnained. This suggestion was borne out in a similar 
experiment invotviiig three days of cycUqg prior to dissection: the ED29A protein clearly 
displayed some l^/^-inducing activity in this expmment presumably as a result of the 
higher amounts of protein that accumulated during the longer cycling period. The wing 

20 in particular, but also other imaginal discs, displayed low and variable amounts of ectopic 
dpp e?q>resdon. This e7q>res^on in all cases was far less extensive than that observed for 
the wild-type construct examined in parallel; fiarthermore, morphological deformations 
of the imaginal discs, although quite common with the wild-type protein, were e^emely 
rare with the H329A protein. Although its potency is greatly reduced relative to 

25 wild-type, the H329A protein retained at least some activity in early mibryonic and 
imaginal disc mduction of wg and d^p expression; in contrast, even under heat shock 
conditions far more severe than those required for effects by the wild-type protdn; the 
H329A mutant remained completdy inert with respect to the re-specification of dell fiites 
in the dorsal cuticle of the larva. • , 
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DTFFERENTIA I. OF N AND C 

INTO CULTURED CELL SUPERNATANTS 

A puzzling feature ofhh function is its apparent short-range action in settings such as 
embryonic and imagjnal disc signaling to wg and i^p, and longer-range action in other 
settings, such as patterning of the dorsal larval cuticle. These obs^ations and the 
existence of two nugor protein products in vivo prompted us to look for differmdes in the 
sohibility or diffiisibility of N and C expressed in S2 cultured cells. FIGURES 8 (A) and 
(B) are immunoblots of cell pellets (lane 1) or supematants (lane 2) from transfected S2 
cell cultures expressing Hh protein, developed with Abl (A) and Ab2 (B). Samples in 
each lane were from the same vohmie of resuspended total culture. Whereas N remained 
mostly associated with the cell pellet (compare lanes 1 and 2 in A), C was neariy 
quantitatively released into the supernatant (compare lanes land 2 in B). U displayed 
partitioning propmies in between those of N and C (A and B). (8C) demonstrates the 
heparin binding activity of various Hh protein species generated by m vitro translations 
with microsomes. Samples were: total translation mfac (lane I); supmiatant after 
incubation with heparin agarose or i^ose (control) beads (lanes 2 and 4); and material 
eluted from heparin agarose or agaro^ beads after washing Oanes 3 and S). F, U, Nss and 
N fragments are depleted ftpm reactions incubated with heparin agarose but not agarose 
beads (conq>are lanes 2 and 4 to 1), and the same species subsequratly can be eluted from 
the heparin agarose but not the agarose beads (compare lanes 3 and 5 with Isme 1). 
FIGURES 8, A and B indeed show that these proteins behave diflferratly, with most of 
the N fragment remaining cell-associated and all, or nearly all, of C bemg released into 
the culture supernatant. 

One possible expiianation for this diflferential behavior might be association of the N 
fragment with extracdlular matrix proteins on the surfiu:es of the S2 cdls. Accordingly, 
the rdative affinity of these two protems for heparin agarose was examined, since heparin 
binding is a common property of proteins that associate with the extracellular matrix. 
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Given the obvious dtfiSculty in obtaining soluble N from cultured cells, in vitro, 
translation in the presence of microsomes was used to generate soluble, labelled N and 
C. As shown in HGURE 8C, N but not C is depleted from these translation extracts by 
treatment with heparin agarose beads, wUle treatment with unmodified agarose beads did 
not deplete either fragmoit. Furthermore, N but not C was retained upon the heparin 
agarose beads upon extensive washiitg with a solution that contains 0. 1% Triton X-100 
and 150 mM NaCl; in contrast, ndtho- fragment was retained by unmodified agarose. N, 
but not C, binds ti^ly to heparin, and this bdmvior suggests that the low concenttation 
of N released into culture supematants may be the resuk of binding to the extracellular 
matrix. Another mechanism that might contribute to the differential release of N and C 
into culture supernatant would be the expression in S2 cells of a receptor for N but not 
for C. Our current data can not distinguish these possibilities. 
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EXAMFLE7 

DISTINCT EMBRYONIC LOCALIZATIONS OF N AND r 

The dififermtial rdease of N and C into cultured cell supematants suggested the 
possibility that these fragments might also be differentially localized in mibryos. 
Previously reported hh protein localizations utilized either antibodies specific for N 
epitopes or antibodies unable to distinguish b'etween N and C FIGURE 9 shows the 
dififerential localizations of N and C in embryos by in atu localization of the hh transcript. 
FIGURE 9 (A) is shown in comparison to the distribution of N and C epitopes detected 
with Abl and Ab2 in panels (9B) and (9C), respectively. Note that the distribution of N 
and C epitopes span approximately one-third and one-half of each segmental unit 
respectively, while the transoipt is limited to approximately one-quarter of each unit. In 
(9D), the localization of C epitopes in embryos homozygous for the hh'^^ allele is detected 
with the use of Ab2. C epitopes in this mutant, which displqrs impaired auto-proteolytic 
activity are more restricted, and resemble the wild-type localization of N. Homozygous 
embryos were identified by loss of a marked balancer firom a heterozygous parent 
stock. All embryos are at mid to late stage 9 (extended germ-band), 

FIGURE 9B shows m accordance with these reports, Abl, which is specific for N 
epitopes, reveals a s^entally localized distribution that is slightly broader than that of 
the hh transcript at the same stage (FIGURE 9A). Also consistent with these reports, we 
observed that N epitopes at lata* stages accumulate in large punctate structures. Our 
analysts here concentrates on the earlier stage, when antibody staining is weaker but 
befi^re formation of the invaginations and grooves that later crease the epidermis and 
thereby complicate the interpretation. Ab2 was also utilized to detect C-specific epitopes 
with a variety of fixation and staining procelduresT Although detection of C epitopes above 
background is more diflScult than for N, we consistentiy observed a segmentally 
modulated pattern, albeit witii a broader distribution than N (FIGURE 9C) lliis 
localization is also distinctive in that C epitopes at early or late stages are not found in thb 
punctate structures characteristic of N. - 
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The hhf^^ mutation encodes a prematurely truncated protein that is missing IS residues 
normally present at the COOH-termmus. Because this protein displqrs a much reduced 
efficiency in auto-proteolysis the distribution of C in this mutant background was 
exanuned. FIGURE 9D shows that C epitopes in a homozygous hh^^^ embiyo (identified 
by absence of a marked balancer) are distributed in a much tighter segmental pattern than 
in wild-type. This locaii2ation resembles that of N, and we thus conclude that the broad 
distribution of C epitopes normally seen is altered in by retention of the uncleaved 
precursor near the ^te of synthesis. 

EXAMPLES 

THE ROLE O F AUTO-PROTEOLYSIS IN BIOGENESIS 

QF AcnvE Bgpcl;^oG protcw 

In addition to agt^ cleavage, the hh protein undergoes autai>roteolyffls at an internal site 
to gen^ate the predominant protein species observed in vivo. All or most of the amino 
acid residues required for this auto-proteolysis function map to C, the caiboxy-terminal 
product of this internal cleavage. In an effort to determine the inqiortahce' of 
auto-proteolysis for fimction, we introduced a single residue mutation ^29A) that 
blocks auto-proteolysis of the Mr protdn in vitro and dononstrated that both processing 
and function of this protrin is inqiaired in vivo. Since similar levels of induced protein 
were detected from a strain carrying the wild-type construct or fit>m several strains 
carrying independent insertions of the mutant construct (FIGURE S), the impaired 
fimction of the H329A protein relative to wild-type is not the result of reduced levels of 
expression. Further evidence in support of a role for auto-proteolysis derives from the 
eflfect of the /i/r"^ mutation, which reduces but does not eliminate auto-proteolysis of the 
hh protein in vitro (FIGURE 4). Correspondingly, the M'^ mutation is associated with 
a phenotype of intermediate strength in vivo (J. Mohler, supra). 

Curiously, the H329A /f/r protdn {^ipears t retain weak activity in embryonic signaling 
to induce ectopic wg expression and, to a lesser degree, can function in imaginal disc 
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signaling for induction of ectopic wg and <^p expression. In contrast to its retention of 
at least some signaling fiinctions in embryonic and imaginal tissues, the H329 protein is 
con:5)letely inert whm assayed for the ability to rcprogram cell fates in the dorsal cuticle 
of the larva. 

5 The assays in which the H329A protein is active or partially active involve short-range 
signaling that normally occurs across one or at most several cell diameters; in contrast, 
the H329A protdn fails to exert any effect upon patterning of the dorsal cuticl^, a 
long-range activity that normally operates across most of the s^ment. Previous 
proposals to account for long-range patterning activities have suggested that hh 

to expression induces other signaling molecules which are then responsible for mecuting 
the patterning functions (the signal relay model; see FIGURE lOA). FIGURE 10 shows 
a signal rday versus dual function models for hh protdn action. In FIGURE 10 (A), the 
long-range effects of hh signaling are achieved indirectly through short-range induction 
of a second signaling molecule (X), Based on its biochcanical properties and its restricted 

IS tissue localization, N is preamied to represent the active short-range signal 

of C would be limited to supplying the catalytic machinery required for biogenesis of N. 
In (lOBX the long- and short-range signaling fimctions of /i^ are supplied by the N and 
C proteins derived by internal auto-proteolysis of the U precursor N is implicated in 
short-range signaling by retention near its cellular site of synthesis, while C is less 

20 restricted in its distribution and would execute long-range ^gnaling fimctions. In both 
models, auto-proteolysis is required to generate fully active signaling protdns. See text 
for fiirther discussion. 

These proposals seek to maintain a consistent mode of het^hog action by rationalizing 
the apparent long-nmge activities of hh products as indireigt consequences of short-range 
25 signaUng. Based on the distribution observed, the active molecule in this model mig 
be N and the role of C would then be limited to supplying the catalytic machmery 
required for biogenesis of N. • 
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Our evidence suggests an alternative model, the dual function modd (FIGURE 1 OB), in 
which long- and short-range activities of the hh protein might be executed by N and C, 
the two predominant fbrms of the molecule observed in vivo. The nearly quantitative 
release of C fragmmt into the culture medium of A/r-expressing S2 cells and its broad, 
though s^;mentaliy modulated distribution within embryos suggests that C might execute 
or contribute to long-range signaling functions. The N fragment, on the other hand, 
predominantly remains associated with the aqpressing S2 cells and also binds to heparin, 
which suggests a possible association with the extracellular matrix. These properties and 
the segmentally restricted embryonic distribution of N are suggestive of a role in the 
execution of short-range hh signaling activities. Since the vertebrate Hh proteins we 
tested also appear to be auto-processed and also carry predicted hq)arin binding sites just 
caiboxy-tenninai to their signal sequences (H. Roelink et al ., supra), many aspects of the 
dual function model discussed here in the context of Drosophila development may also 
apply to hh protein function in vertebrate development. 

Execution of ^rt-range fimctions by N would be consistent v^th the observation that the 
H329A mutant protein has at least partial fimction in signaling for the induction of wg 
and (^jp, since this mutation does not aher residues located in the amino-terminal portion 
of the protein that normally would give rise to N. Th^ imdeaved H329A protein thus 
would cany all the residues that normally interact with a presumed receptor for N, 
although there might be some effect on the affinity of the interaction due to the presence 
of carboxy-terminal sequences, thus accounting for the decreased potency of the H329A 
protein. Alternatively, the partid function of H329A protein may derive from an 
extremely small fraction of protdn that appears to be cleaved, a very fiint band with 
identical mobility to C appears in in vitro translations with the H329A proteiii (FIGURE 
4, lane 3). Execution of long-range fimctions by C is also consistent with our 
observations because long-range signaling might require the release of the C fhigment or 
otherwise require the H329 residue for son^ fimction other than for cleavage 
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When N is synthesized fronn a native construct (wild type hh), it remains primarily celU 
assodated (FIGURE IOC), however, N generated finom a truncated consUuct in cultured 
cdls predominantly enters the culture medium (FIGURE lOD) (For constructs, see Porter, 
e/ aL, Nature, 224:363, 1995). These results further confinn that autoprocessing by 
5 fragment C may regulate the degree of N association with the cell suxfece and therefore 
its range of action. 

ISOLATION OF HEDGEHOG H OMOLOGUES 

The mouse and human Mr-like sequences were isolated by polymerase chain reaction 
10 (PCR) using primers degenmte for all possible coding combinations of the sequences 
underlined in FIGURE 1 of Chang, e/ a/., (Development, 120^^^ 1994). PCR amplifications 
contained firom 100 ng to 2 ng genomic DNA (depending upon the genome size of the 
species), 2 \iM of each primer, 200 dNTPs (Pharmacia), IX reaction buflFer 
(Boehringer-Mannteim) and 2,5 units Taq pofym«rase (Boehringer-Mannheim) in 50 (xl 
15 reactions. An^lification was as follows: 94 ''C 5 min, addition of Taq polymerase at 
75"C, followed by 94 *C 1 min, 52**C 1.5 minand 72*C 1 min for 30 cycles and a final 
extoision of 72''C for 5 nm All PCR products were cloned into pBluescript (Stratagene) 
prior to sequence determination. 

Mouse clones obtained in this manno^ contained 144 bases of sequence between the 
20 primer ends and were labelled with [a'^JdATP and used for high stringency screens of 
mouse cDNA libraries made fi-om whole 8.5 dpc embryonic RNA and from 14.5 dpc 
embryonic brain in the XZAP vector (a gift from A, Lamdian). Several clones 
conesponding to /ffirg-1 were isolated and the tergest, 2629 bp in length (pDTC8,0)» was 
chosen for sequence analysis using dideo?^ chain termination (Sanger, et al, 1977) and 
2S Sequenase v2.0 (US Biochemicals). Conqiressions were resolved by using 7-deaza- 
guanosine (US Biochenucals). Sequence analy^s made use of the Genewofks 2.0 
(IntdliOenetics) and MacVector 3.5 (IBI) software packages. 
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One of the three mouse clones, B?fg-l, when used as a probe, yielded a 2.0 kb clone from 
a 8.5 dpc mouse embryonic cDNA libraiy and a 2.7 kb clone from a 14.5 dpc embryonic 
cDNA library. The 2.7 kb cDNA appears to tepr^ent a nearly full length mRNA because 
it corresponds to a 2.7 kb band detected by hybridization on a Northern blot. The largest 
5 methionine^nitiated open readiiig frame within this cDNA encompasses 437 codons, and 
is preceded by one in frame up^eam stop codon. Sequence comparisons indicate that 
the protein encoded by Hhg-l is identical to the indepffldently charac^mzed rnouse Shh 
O^hdaid, etal.. Cell, 75: 1417-1430, 1993) except for an arginine to lysine SSetmcp at , 
residue 122. Hhg-l also corresponds closely to the rat vhh-l gene (97% amino acid 

10 identity; Roelink, et al.. Cell, 2^:761-775, 1994), the chicken Sonic hedgehog (81% 
identity; Riddle, et al.. Cell, 25:1401-1416, 1993) and S/r/r from the zebxafish (68% 
identity; Krauss, et al. Cell, 25:1431-1444, 1993; Roelink, et al. Cell, 26:761-775. 
1994). The PCR-generated fragments Hhg-l and /Ag-3 appear to correspond to the 
Indian and Desert classes of mouse hedgehog genes, respectively (Echdard, et cd, , Cell, 

15 25:1417-1430, 1993). 

Alignmoit of the Hh%A open reading frame with the two Drosopkila hh sequences 
, showed that all three proteins contam hydrophobic amino acid sequences near their 
amtno-termini; the hydrophobic stretches within the D. melanogaster protdn (residues 
64 to 83) and within the mouse protein are known to act efficiently as signal sequences 
20 for deavage (Lee, et id. Cell, 21:33-50, 1992). Both Drosoj^la signal sequmces are 
unusual in their internal locations, while the hydrophobic stretch of the mouse gme 
occurs at the extreme amino-terminus, a more conventional location for cleaved signal 
sequmoes. Ahhoii^ portions of sequence N-teraiinal to the Drosophila signal sequences 
are conserved, suggesting a fimctional role, the mouse gene lacks this region. 

25 The overall level of amino add identity between HhgA and M caiboxy-tenninal to the 
signal sequences is 46%. A dos^ examination shows that the amino terminal portion, 
from residues 25 to 187, disphiys 69% identity, while remaining residues in the carboxy- 
teraiinal portion display a much bwa* 31% id^ty. Like Mt, the /ffiig-1 coding sequence 
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is divided into three exons, and the boundaries of these exons are at the same positions 
within coding sequence as those of the three Drosophila hh exons. Curiously, the 
boundary between coding sequences of the second and third exons occurs near the 
transition from high to low levels of overall sequence conservation. The coincidence of 
s these two boundaries ^ggests a posdble demarcation of functional domains within these 
proteins. This location within HhgA codmg sequence also coincides approximately with 
the site of a presumed prbieolytic cleavi^e. 

S VMAW C L QWIP^G OF h h ggNE$ 

10 Partial sequmce for two human hh genes has been obtained by DNA sequencing of 
clones derived by PCR an^lification from genomic DNA with /r/r-specific degenerate 
primers as outlined in Chang, et al,, (Development, 12Q:3339, 1994) and EXAMPLE 9 
(FIGURE 1 1 A and B). More extenmve screening by the same approach, dther with the 
same primers or with other primers from the hh coding region or with the human hh 

15 fragments seen in FIGURES 1 1 A and B, is expected to yield at the least a third gene, and 
possibly more, since at least three genes are found in the mouse. These ^gments of 
human hh genes can be used to obtain &0 coding sequences for human proteins by the 
following cloning method commonly used by those of skill in the art and which are 
extensively described in the literature. 

20 For example, ready-made cDNA libraries or RN As from a variety of human sources, 
including various fetal stages and organs (from abortuses) and specdfic infant or adult 
oigans (from pathological or autopsy spechnens), are being tested for the presence of hh 
sequences by PCR or RT-PCR using the primers described in Chang, ei al., supra, and 
other primers derived directly from the ^uence of the human fragments. Ready-made 

25 libraries containing hh sequences are being screened directly and, where necessary, new 
libraries are being constructed by standard methods from RNA sources contaihirig A/r 
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sequences. The probe for these screens is a mixture of all the distinct human hh 
fiagments. Sequences of cDNA clones can then be determined. Most clones containing 
the probe sequences, v/bich are located in the N region, vvill also include a fiill C coding 
region since standard methods of Ubraty construction result in cDNA clones that are most 
s complete at their 3' ends. All fiiU length Aft-codmg sequences obtained previously in 
vertebrates and invertebrates contain N and C sequences encoded in a single RNA. 
Screening is continued until complete open reading fiames that correspond to all of the 
fiagraents of human hh genes are obtained . Specifically, 1.2x10* clones from a humdn' 
fetal brain library (Stratagene. La JoUa, CA) was screened using a nuxture of the two 
10 human hh fragments (FIGURE 1 1 A and B) as probes. Twenty-nine clones were 
identified as specifically hybridizing with these probes. 

Second, the RNA sources identified as containing hh sequences can be used as templates 
from anchored PGR (also referred to in the literature as RACE, for rapid amplification 
of cDNA ends). Briefly, this method provides a means to isolate fiirther mRNA sequence 
15 in either the S* or 3* direction provided that sequence is known fiom an internal starting 
point. Anchored PCR can also be used to isolate sequences from cDNA library 

Thinl, genomic fibraries can be screened with the probes descnl)e^ 
Where necessary, human hh exons and coding sequences are being identified by 
hybridization to previously isolated human and mouse codipg sequences by sequcaice 
20 determination, and by exon-trapping methods to identify all hh coding sequences v^thin 
genomic clones; these coding sequences can be "stitcdied" togedier by standard 
recombinant DNA m^ods to generate complete M opm reading fiames. 

FIGURE 12 A and B show in vitro cleavage reactions of b Drosophila hh protein 
produced in K coli and purified to homogeneity. This protdn has residues 89-254 
2S ddeted, rendering it more soluble and easier to purify. It also contains a His^ purification 
tag appended to the N-terminus. Autoproteolysis of this protein is triggered by the 
addition of reducing agents (DTT), and the resulting product corresponds to the C 



96/17924 



PCT/DS9a/1940 



^8- 

fragment identified in vivo. FIGURE 12, Panel A shows a time course of cleavage after 
initiation by addition of DTT. Panel B shows incubations of concentrations ranging over 
three order of magnitude for a fixed time period (four hours), with no difiPerence in the 
extent of conversion to the cleaved form. This concentration-indepinident rate of 
cleavage indicates an intramolecular mechanism of cleavage. Panel C shows the 
sequence around the cleavage ^te as determined by amino-terminal sequence of the 
cleaved fragment C. The cleavage sit£.is denoted by the wrow, and the actual residues 
sequenced by Edman degradation of the C fi:agment are underlined panel C al so shows 
an alignmmt of all published vertebrate hh sequences plus some of unpubliished 
sequences from fish and Xenopus. The sequences shown correspond to the region of 
Drosophilahh Vi/hett the cleavage occurs, and demonstrates the absolute conservation of 
the Gly-Cys-Phe sequence at the site of cleavage. Pand D shows a SDS-PAGE gel 
loaded with in vitro transcription/translation reactions as described in the previous 
Examples, using various hh genes as tenq)lates. dhh is Drosophila. twhh and zfshh Bxc 
tiie twiggy-winUe and swfic hh genes of tte zebrafish, and txvshh is the shh/Hgh-l/vhh-I 
gene of the mouse. The transhition mfac included ^^S-labelled cysteine, used to visualize 
the resulting products by autoradiogrqifay. Note that each gene give a larger product (the 
precursor or U) and two smaller products of cleavage (N and C). The larger species is 
C for eadi of the vertebrate genes, y^o-eas the DrosophilaVl is larger than C due to the 
prraence of -60 residues occurring amino-terminal to the signal sequence that ^re pr^irit 
in the vertebrate open reading finame. This panel shows that vertebrate hh proteins are 
processed similarly to the Drosophila protdn. Panel E ^ows that Edmah degradation 
of the C fiiagments releases "S counts on the first but not subsequent rounds for all these 
proteins, indicating that the dte of autoprotcolytic cleavi^ for all of these hh proteins 
is the amide bond to the amino-terminal side of the Cys residue that forms the center of 
the conserved Qy-Cys-Phc sequmce highlighted in panel C This is a generalizable 
approach to establish the composition of protein firagments fiiom any otter M fiunily 
membCTS. 
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EXAMPLEll 

DIFFERENTIAL EXPRESSIO N OF TWO hk GENES IN AXIAL 
MESODERM AND IN NEURAL FROGEWITQ RS. 

Partial sequences oorresponding to five distinct zd>rafish Mr-like genes were isolated and 
s tiie complete coding sequences for two of these genes were obtuned fiom an embryonic 
cDNA Ubiary. One of these two sequences islienticai to that of the zdwafish wM-/ gene 
(Rodink, era£, C«/7,76:761, 1994), and appears to correspond to the sWr gen© rqwrted 
by Krauss, etaL, (Cell, 25:1431, 1993) (See FIGURE 13 description); the other gene, 
tiggy^inkle (Potter, B., The Tale of Mrs. Tiggy-Winkle, Jhe Penguin Group, London, 
10 1905), reprc8«its a novd votebrate hh. Coding sequences for both are shown in 
alignment to mouse and chidcen sequences of the sonic/vMi-l dass (FIGURE 13b). Like 
other vertebrate hh homologues, the twhh and shh protdns contain an amino-terminal 
stretch of hydrophobic residues! These readues ftmction as signal sequences since 
cleavage is observed when coditig sequences are translated in the presence of micro- 
is somoses; vertebrate hh genes thus appear to encode secreted proteins, as previously 
reported for Drosophila hh (Kimmd C.B. & Warga, R.M., Developmental Biology, 
124:269-280, 1987; Warge, R.M., & Kimmd. C.B., Development, i(JS:569-580, 1990). 

The first four sequences were isohited fitim zdirafish gaiomic PNA (a gift fit>m J. 
Pdlc^rino) using degenerate immers in polymerase chain reactions as desoibed (Chang, 
20 etaL, sqjru). twhh and Mi clones wwe isoUited from a 20-28 hour dONA Itorary (a gift 
fiom R. Riggleman, K. Hdde, D. Giunwald and J. PeUegrino) using the first three 
sequences as probes. The transhitional reading fiwnes for twhh and Mt were closed 12 
and 16 codons, re^ectively, upstream of the putative initiating niethionine. 

Figure 13 shows thcf^redicted aimno acid sequences are shown in single letter code. 
25 13(a) shows sequences comihon t five distinct Afc-like genes are shown with a dot 
indicating identity with the corresponding residue of zebrafish twiggy-winkle (twhh. 
Potter 1905; supra), hh[zffl) and hh[zfC] is more diverged and appears to reprint a 
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novel class. 13(b) shows amino add sequences of twhh and shh are aligned to those of 
the soniclvhh'l class from chick and mouse (Riddle, et cJ., Cell, 7§: 1401-1416, 1993; 
Chang, D.T., et al., Development, stq>ra\ Echelard, Y., et al. Cell, 25: 143 1-1444, 1993). 
Zebrafish sonic hedgelwg (Mi) is identical in sequence to z-vhh-I reported by RoeUnk, 
5 et al. Cell, 76761-775, 1994. Based on expression and extensive sequence identity 
throughout most of the coding region, vhh-1 and the sonic sequence reported here 
probably correspond to of Krauss, et al, Ce//, 2i: 143 1-1444, 1993, diverges 
dramatically throughout a 26 residue stretch near the carboTQr-teraiinus. Rat vl^-t/f^nic ^ 
hh (Roelink, et al., aam ) ^vas exchided in this alignment because of its 97% sequence 

10 identity to the predicted mouse protein. Residues idaitical in all four sequences are 
boxed, and a dash indicates a gap in the alignment. The arrow indicates the predicted 
signal sequence deava^ site (von Hc^e, G. , Nucleic Acids Res., 4683-4690, 1986) 
for twhh. The amino-terminal hydrophobic stretch common to all four hh genes is 
shaded. 13(c) shows percent identity of residues caiboxy-terminal to the hydrophobic 

15 regioa 

Figure 14 shows a comparative e}q>ression of twhh, shh, and paX'2 during zebrafish 
embryogene^. Whole moimt in situ hybridizations on 0-36 hour embryos were 
p^ormed using a modification of the procedure of Tautz and PG^e, Chronosoma, 
2S:81-85, 1989, with antisense probes. Transcript localization is i^eyealed by the purple 

20 product of an alkaline pho^hatase enzymatic reaction. Staging of the embryos is 
according to Westeifield, M., (The Zebrafish Book, University of Oregon Press, Eugene, 
1993). Transcrqits were visualized by i;?5r/iilQi)ridization to whole embr^^ 
e}q)res5i(m in a ^^gie late shield stage embryo, (a) Dorsal view, animal pole is to the top 
The triarigular shape of e94>ression is characteristic of axial mesoderm-forming cells of ^ 

25 the hypoblast (Statchel. S.E., et aL. Development, 117:1261-1274, 1993). (b) Lateral 
view: the thicker layer of cells on the left (dorsal) side of the embryo is the embryonic 
shield; the two arrows indicate the ^i^/r/r-expressing hypoblast cells and the non^ 
oqnessing cpiblast. Anterior is to the left in all subsequent embryos. Dorsal is to the top 
in all lateral views, (c, d) A single embryo at the end of gastrulation (100% epiboly) 
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with iwhh-eKpressmg cells, (d) Caudal-dorsal view. Note the wide patch of stain in the 
presumptive tailbud v/bich narrows anteriorly, (e, j) Eariy somitogenesis (11.5 hour, 3-4 
somite) embryos; optic vesicles have not begun to evaginate from the waU of the 
diencephalon. (e, h, k) Lateral views of developing brain, (f, i, 1) Dorsal views of 
developir^ brain, (e, ^ g) Localization of /H^M-expressing in a single row of cells that 
will form the flood plate. The arrowhead marks a parch of /wA:ft-expressing cells lateral 
to the tailbud. (h, i, j) Localiz^on of shh is also expressied strongly in the 
protuberance, (j) Lateral view of the developing tail, shh is also expressed stror^y- in 
the protuberance, (j) Lateral view of developmg tail shh is expressed in cells that will 
form both floor plate and notochord. (k, I m) Localization of pax'2 during early optic 
vesicle formation; (m) also shows iwhh wpression. (k) 12 hour (4-5 somites) embryo. 
(1) 12.5 hour (5-6 somites) embryo. Expression of pcBc-2 in the developing optic vesicle 
is in a gradient away fix>m the protuberance. Note the expression of pax-2 (asterisk) at 
the future midbrain-hindbrain border, (m) twhh (arrow) and pax-l e)qiression in a 6-7 
somite (13 hour) stage embryo. Note difierential expression of twhh in ventral noiral 
ked (corresponding to neural tube in other vertebrates), (n-s) Embryos at end of somito- 
genesis (22-24 hours), (n, o, p) Localization of twhh (n, o) Developing brain. Note 
isolated groups of cdls stainirig in the difficq>halon (filled triangles) and the 
protuberance (arroindiead), and floor plate ^ression underlying Ae midbrain and 
hindbrain. The floor plate e)qjression is contiguous caudaily along the axis, (d) Lateral 
view, (o) Dorsal view, (p) Lat^ view of tail. E)q)resaon is restricted to the floor plate, 
(q. r, s) Localization of shh, (q, r) Developing brain (q) Lateral view. pox-J? expression 
in the otic vesicle is indicated, (r) Dorsal view, depression in the protuberance 
(aiToi^iiead) and in flie neural ked. (s) Lateral view of tail Expression is strongest in the 
floor plate, but contrary to 4e report of Krauss, et cd,^ supra ., is ^1 also in the notochord. 
Abbreviations: white e - epiblast; h - hypoblast; tb - tailbud; p - protuberance; c - eye; 
ov - optic vesicle; ot - otic vesicle; - floor plate; nc - notochord; asterisk - midbrain- 
hindbrain boundary or /Kcc-2-labeled prospective midbrain-hindbram boundary; t - 
telencephalon. 
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Conq)anson of twhh and shh expres^otk patterns (Krauss, el al. supra), reveals that both 
gene are predpnunantly expressed in midline structures, albdt with notable differences 
in regard to tinupg, rostra-caudal extent, and tissue restriction. Expression oltwhh is first 
detected during gastiulation in the dorsal nacsoderm (FIGURE I4a» b); this e)q>re^ioh 
occurs in a band corre^nding to a subset of the enri>iyonic shield, a structure, analogous 
to Spemann's organizer in Xenopus (Stadiel. et al. Dev., 112:1261-1274, 1993, and 
refarenoe therein; Ho . IL, Sfiwinosyjw De^lopmental Biology, pg.3, 1992) In concert 
with the movements of convergence and extension, this band of twhh expression shortens 
along the equatorial plane and extends along the incipient embryonic a^ds until, the 
end of gastrulation, expresaon occurs throughout the entire axis (FIGURE 14c,d). Early 
in sonutogene^ twhh RNA is found restricted to presumptive ventral neural tissue along 
the entire body (FIGURE 14e, f, g), the only exception being cells in and near the tsdlbud 
(FIGURE 14g). In contrast to the neural restriction of twhh, shh is localized both to 
presumptive neural and notochordal cells (FIGURE 14j). 

As sonutpgeneas proceeds, ventral nddline expression of shh and twhh is reduced in most 
of the prospective forebrun, but remains strong in an anterior patch of nudline cells 
within Ae floor of the prospective diencephalon (FIGURES 14c, f, for twhh, FIGIJRES 
h, i for shh). This patch later will give rise to the protuberance (Schmitt, E.A. and 
DowUng, J.D., J. Comp. Neur., 344:532-542, 1994). an anterior extension of" the 
diencephalon. tUs structure, which is medial and just rostrid to tiie developing optic 
stalks, is the ^te we propose as the focus of early patterning activity for the developing 
eyes (see bdow). By the end of soimtogene^s, both twhh md shh are strongly e)q>ressed 
in the floor plate (FIGURES 14p, s), although shh transcripts remain detectable in the 
notochord at this sts^e and at 36 hours of developmmt (FIGURES 14s; later stage not 
dK>wn). At 28 hours, twhh transcripts are also found in a small cluster of cells within tiie 
first gill arch (not ^own), as also reported for shh at 33 hours of developnient (Krauss, 
eial., siq^ay 
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Difiermces between twhh and sA/r e?cpression are ^>parent from the beginning of 
gastrulation, since iM'/iARNA can be detected as early as the shield stage vMeshh is first 
detected later, at about 60% epiboly (not ^own; (Kiauss, et al., supra). In addition, twhh 
transcripts are restricted to neural tissues early in development, and are never detected in 

5 the notochord (compare FIGURE 14g to FIGURE 14j). Later differences in e7q)ression 
include differential rostra-caudal restriction within the dsencephalon and midbrain and 
weaker and more restricted ^ression of twhh in the protuberance (compare ||pURES 
14n and 14q), such that the later domain of twhh expression in the brain ^pear^ to 
constitute a subset of the domain. In addition, sAA but not m^/iA is e9q)ressed in the 

10 developing fin bud (Krauss, et al,, supra). Comparison of shh and twhh expression 
patterns to this previoudy reported for hh homologues in zebrafish and other vertebrate 
species indicates that shh is the zebrafish homologue of the sonicA^hh-I class wlule twhh 
rq)resents a novel class of vertebrate /eA. 

15 DEVELOPMENTAL CONSE QUENCES OF ECTOPIC hh 

EXPRESSION DURING ZEBRAFISH EMB RYOGENESIS 

To gain insi^ into the potential roles of hh products in development, synthetic twhh and 
sM mRNA was injected into 1-8 cell embryos. This technique yields a mosaic but fiurly 
unifonn pattern of expression, as detemuned for the control mRNA encoding p- 

20 galactostdase (not shown). Unifi>rmity of e^q^ression is in good agreonent with fiite 
mapping studies of the eaiiy zebrafish embryo (Kimmd & Warga, supra; Warga & 
Kinomel, supra; Heide, et al.. Science, 265:517-520, 1994), which indicate that 
blastomeres undergo octensive cell mixing during the cleavages prior to gastflilation. We 
note that mosaidsm of e7q>resfflon caused surprisingly little variation in the phenotypes 

25 of the M iiyected embryos, pos^bly due to secretion of Mr gene products. v 

Embryos injected with synthetic twhh or shh mRNA (/r/r RNA) exMbited numefOiis yet 
highly reproducible abnormalities in comparison to control embryos iigected with iorcZ 
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tnRNA. These abnormalities, disoissed below, are primarily defects in the brain and 
eyes. Although the effects of ectopic twhh and shh »pression were qualitatively similar, 
the inddence ami severity were greater with AvMRNA (see text M IS and 

FIGURE 16). The proteins encoded by these two genes have qualitatively similar 
5 biological activities, but apparent diff^ences in potency. 

FIGURE IS shows th% effects of ectopic hh on zebrafish developnient. Wild type 
zebrafish, Danio /ier/o, Ekkwill Waterlife Resources) were maintained at 28.ST, some 
embryos were then cultured qyemight at RT. Zebrafish embryos were injected at the 1 -8 
cell stage with twhh, shh, or /ac2RNA and examined at 28 h of development iftrc) 

10 Dorsal view of the midbrain-hindbrain region; anterior is left, (a) lacZ. (b) twhh, (c) shh. 
(d*f) Frontal optical section of the forebrain region; anterior is up. (d) lacZ. (h) twhh, (f) 
shh (g-1) Lataral view of the eye region; anterior is left , (g) iacZ. (h) twhh. (i) twhh. At 
levels caudal to the prospective brain, the notochord, somites, and neural keel formed by 
most Afr-injected onbryos appeared grossly normal except for an overall ^ortening and 

15 dorsal curvature of the axis. A minority of /i^injected embryos (1S% are not shown) 
displayed partially bifiircated axes, containing duplicated axial mesoderm and parallel 
neural keels, each neural keel compri^g ventral midline ceUs and some bilaterally 
symmetric lat^ cells (not ^own). Althou^ we have not determined the primary cause 
of these axial defects, analysis of late gastrulation stage mibryos suggests hat the 

20 bifiircation ntiay result fix>m difficulties in epiboly and co^ Abbreviations: mv - 

mesencaphaiic vi»itricle; rv « rhombencephalic rentricle; asterisk - midbrain-hindbrain 
boundary; ot - otic vesicle; tv - third (diencephalic) ventricle; r - retina or retina-like 
structure; 1 - lens or lens^Uke structure; pe - pigmented retinal epithelium. 

Morphological defects in the brain and other rostral neural derivatives occur at high 
25 fi-equency in A/r-jnjected embryos. The three ventricles of the fish bram normally 
apparent at 28 hours of development - the rhombencephalic^ mesencephalic (FIGURE 
15a), and diencephalic (third ventricle; FIGURE 1 5d) - are not formed in the brains othh 
iiyectees (FIGURES 15b, c; FIGURES 1 5e, f)), despite the obvious presence of ?i lumen. 
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The prominent construction normally present at the midbrain-hindbrain boundary also is 
absent (compare FIGURE ISa to FIGURES ISb, c). Formation of this constriction 
requires function of/wr-^ (Krauss, et al. Nature, 353:267-270, 1991; Krauss, et al., 
Nature, 26^:87-89, 1992), which normally is expressed in a band at the midbrain- 
5 hthdbrain boundaiy (Krauss, eial, supra, Krauss, etai, Development, 113 .1 193-1206, 
1991) pax2 expresfflon at this boundary is not disrupted by hh RNA injection, however, 
indicating tfiat this phenotype does not resuh from dimiption of rostral-caudal 
information. 

Defects in eye devdopment also occur at hi^ frequency in embryos injected with M 
10 RNA. Thus, while at 28 hours the normal zebraftsh eye has a lens and a retina with 
pigmented epithelium (FIGURE ISd, g), A^injected embryos usually &il to develop 
lenses and retinal pigmentation (FIGURE ISe, h). Eye duplications are also observed at 
low frequencies (FIGURE ISi). The poorly developed eyes do not appear to result fix>m 
a ^n:q>le delqr in developmmt since pigmentation elsewhere in injected embryos appears 
15 in its normal time course. Examined at three days of development, the consequences of 
hh RNA injection include defects that range from complete absence of eyes to partially/ 
formed eyes ladrtg a ventral portion of the retina. « ^ 

The eye phenotypes caused by hh RNA injection resCTiUe diose pfoduoed by tre^tm^t 
of zebrafish and Xenopus laevis embryos with rednoic add htXenopus, phenotypes 

20 range from reduction of the eye and absence of the loids to eyes with retinal folds 
(resemblhtg duplicated dyes) and multiple small lenses (Nfatms, M. Sc Fritzsch, B., 
NeuroscL Lett,, 127:150-154, 1991). In zebrafish, exposure to retinoic acid during 
gastrulation interferes with the fi>rmation of the eye (Holder, N. &, Hill, J., Development, 
111:1 159-1 170, 1991), while exposure during formation of the optic primordia induces 

25 formation of duplicated retinas and extra loises (Hyatt, etal., Proc. Natl AcadSci. USA, 
£9:8293-8297, 1 992) Pattemir^ efifects of retinoic add upon the developir^ chide limb 
appear to be mediated through ectopic activation of the endog^ious sonic hh geno 
(Riddle, et al, supra), these results with ectopic hh ^ression suggest the possibility of 
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a similar meduuiisin underlying the patterning effects of retinoid acid treatment in the 
vertebrate 



bh EXPRESSION IN THE OPTIC VESICLE SPETIFIES 
s PROXIMAL FATES AT THE EXPENSE OF DISTAL FATES 

To further elucidate the role of A/i in eye development we utilized pax-2..aaA pax-6 
(Krauss, etal, EMBOJ., 1Q:3609-3619, 1991; Pitischel, et aL. DevehpmM, 11^643- 
651, 1992) were utilized as positional markers to examine the effects of ectopic ^/? 
expression on the optic vedcle. As the optic veacle cvaginates from the lateral walls of 
10 the zebrafish forebrain (Schmitt, EA. & Dowling. J.D., J. Camp. Neur., 244:532-542, 
1994), pax'2 is expressed in a gradient, with highest RNA levels in the anterior and 
ventral r^ons of the optic v^cle (Krauss, et al., stq)ra, FIGURE 14k, I, m). 
Immediately adjacent to the maximum of this pax*2 expr^ion gradient is the region of 

the dicneqihalon tenned the protuberance (Schmitt & Dowiing, supra), where both iwhh 

* ..." ' 

15 , and sM but not pax-2 m strong expressed (FIGURES Me, ^ h, i, m) The 

concentration gradient of /wdc-2 expression in the eptic vedcle thus appears to incline 
downward from its maoddium at a Ibcadon adjacent to the site of twhh mdshh exprcMiion 
in the protubo-ance. Superposition of dcvdopmental fete within the optic vesicle 
(Schmitt, et aL, supra), upon the pattern of pax-2 expression suggests that the gradient 
20 of pax'2 RNA prefigures the future proximal/distal axis of the eye. 

Ectopic hh ahers the e7q)r^on of paX'2, pax'6, and F-^mUn. Zebrafish embryos were 
injected at the 1 -8 cell ^e with twkh or shh RNA and the pattern of pax-2, pax-6, or F- 
spofuiin expression was examined by whole mount in situ hybridization Control 
embryos injected with lacZ RNA w«e performed in every case and displayed wiW-type 
25 expression patterns. At embryo stage, the anterior*posterior axis of the optic vesicle 
corresponds to the fiiture proximal-distal axis of the eye. During the next hour of 
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devdopment, the posterior edge of the optic vesicle will separate from the dtencephalon 
(Schmitt and DowKng, Ccwi^p. Neur., 2M:532-S42, 1994). 

Injection of either hh RNA causes uoiform initiation of pcDC-2 tT^xesmn along both the 
proximal-distal and dorsal-ventral axes of the optic vesicle as it begins to evaginate. The 
5 ectopic pcDc-2 expresaon q>pears at the same time as normal pcDC'2 expres^on is initiated 
in the eye, and in some cases, is also seen in the diencephalon behveen the optic vesicles. 
At the end of sonutogeneas, a time \^en pax-2 would normally be restricted to the optic 
^alk, pax-2 RNA m hh injected mibryos is detected in all but the most distal portion of 
the optic vesicle. 

10 The effects of ectopic hh on e>q[>ression of pcxx^, whidi encodes a transcription factor 
critical for eye development was also studied. At 22 hours of zebrafish development, 
paxS is nomudly expressed in the lens and in most of the distal part of the optic cup 
(Krauss, et al., siq»ra\ Puschel, et aL, Devehpment, 114:643-651, 1992). In M-injected 
embryos, pax-6 is repressed in the optic vesicle, although many embryos retain 

15 expression in the most distal cells. \^th rc^gard to pax'2 and pax-d as markers of 
positional identity, hh expresdon in the optic vesicle can be characterized as inducing 
proximal &tes and repressing distal fates. 

The distal part of the optic vesicle is tiie most refractory to /rft-induced changes in both 
paX'2 and pax^6 gene expression. Due to a later rotation, this distal portion of the optic 
20 vesicle will give rise to the dorsal portion of the mature eye (Schmitt, ei al, supra), 
interesting, this is the portion of the eye that remains in 3-day old iiqected embryos with 
intermediate phenotypes (see above). 

Lesions in the pax'^ gene have been assigned as the basis for the Aniridia (Ton, ei al.. 
Cell, 67:1059.1074, 1991; Glaser, etoL, Nat. Genetics, 2:232-239, 1992), Small eye (Hill, 
25 ei aL, Nature, 3M:S22-52S, 1992), and eyeless mutations (Quiring, et al. Science 
2fiS:785-789, 1994), in hunums, mice and Drosophila, respectively; pax-^ function thus 
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appears to be critically required for eye development in Drosophila and mammals. As 
we argue here, /r/i-cncoded activities also appear to play a role in vertebrate eye 
development, and this suggests a further molecular parallel between vertebrates and 
insects, since the role of M in Drosophila eye development is well established (Mohler, 

s etoL, siqmr. Ma, etal., stgmi, Hebalein, etal,, supra; Lee. etaL, supra). The reciprocal 
and non-ove]i^>ping patterns ofhh and pax-6 expression in the developing Drosophila 
eye (Ma. et aL, supra, Quiring, et al.. Science, 26^:785-789, 1994), suggest the 
possibility of pax'6 repression by hh, but whether hh fimctions by similar mechanisms 
in vertebrate and Drosf^hila eye development is a qu^ons that requires further 

10 investigation. 

In mice, the dosage ofpar-6 protein is crucial for normal eye development (Hill, et al., 
supra). Small eye hetcrozygotes develop an abnormally small lens (Hogan, et al, J. 
EmhryoL Exp. Morph, 92:95-1 10, 1986; Hogan, et aU Development, 103 Suppl, 115- 
1 19, 1988), as do Aft-irgected embryos whh weaker phenotypes (FIGURE 140. Small eye 
IS homozygotes lacking lenses eventually generate and the animals lade eyes at birth 
(Hogan, et al, supra, Hogan, et at, supni^y as do many of the A/r*mjected embryos at 
three days of development. These parallels suggest that nwny of the later eye defects 
observed in M-injected zcbrafish may be caused by partial or complete repression oipax- 
6 during eye development. 

20 EXAMPLE 14 

GENETIC ABLATION OF hk FOREBRAIN E XPRESSION CAUSES LOSS OF 
PROXIMAL FATES IN THE OPTIC VESICLE 

The patterns oftwhh and shh expression (FIGURE 14) and the effects of ectopic hh 
CTcpression (FIGURE 15) are consistent with a normal role for shh and twhh in' eye 
25 development. If Mr activities indeed play a normal role in promoting proximal fates 
within the developing eye, removal of hh activities would be expected to result in a loss 
of proximal fiates. In embryos homo^gous for the cyclops mutation ventral neural 
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structures faU to form and the developing eyes fuse at the midline,, yielding an embryo 
with a single eye (Hatta, ei aL, Nature, 252:339-341, 1991). The missing ventral 
structures in cyclops mutants include the re^ons where we observe expression ottwhh 
and sfcfe, ami we therefore examined the effects of the cyclops mutation on hh expression. 

cvc^*^ (Hatta, et al., Nature, 25fi:339-341, 1991), heterozygous adults (a kind gift of R. 
Riggleman) were spawned and their ofisprmg analyzed by w*iole mount in situ 
hybridization. Detection of pax-l and dtha^ twhh or shh RNAs in embryos homozygous 
for the eye mutation or their wUd-type siblings, twhh RNA is only expressed in the 
presumptive tailbud (caret) of eye embryos. As rc^rted by Krauss, et aL, Cell, supra, 
neural expression of is abolished in eye embiyos. Strong /Knr-2 expression was 
observed in the optic vesicles of wild-type anbryos which is significantly reduced in eye 
mutant embryos. 

twhh RNA in eychps embryos is found only in a small patch of cells at the presumptive 
tailbud and neural expression was not detected at any later stage examined. Neural 
expression of shh is also lost in eye mutants, although expression in the notochord is 
reunited (Krauss, et al., siqjra, data not shown). 

Since the eye mutation appears to ablate hh-expre^^ng cdls in the developing brain, this 
mutation can be used as a gaietic tool to examine^ Ae requirement for hh fimction in eye 
development. Batta, et a/.; Hatta, etal.. Proc. Natl. Acad Sci. USA, 21:2061-2065. 
1994), recendy daitonstrated that /wx-5 expression is fUsed at the midline due to loss of 
ventral midline cdls that normally do not express pax-6 and, in addition, pax'2 
expression in the fiised eye of eye mutant embryos is reduced. We extended these 
observations to an earlier stage v^iien the optic vesicles first form and found that pax'2 
expression is weak and fidls to extend within the vesicles in eye mutants. In conjunction 
with the results of ectopic M eqiression, these observations suggest that hh signaling that 
activity promotes and is required for the induction of proxima fiites within the eye vesicle. 
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In this model, we propose that the protuberance acts as a proxunal pattermng ceht^ for 
the developing zebrafish eye by providing a localized source of hh activity. 

hk AmVTTV Vriyra AT JZFS THE DEVEIXIPING BRAIN 

5 Previous work has established an important role of signals from the floor plate and 
notochord in ventral patterning of the nwral tube (Jessell, T.M., & Dodd, J., Cell, 69:95- 
110, 1992). For example, Goulding, etal.. Development, 111A001A016, 1993, recently 
demonstrated that notochord and floor plate grafts can rq>ress the normal lateral 
expression oipax-S in the neural tube. Other recent woric has unplicated hh activity in 
10 at least some aspects of ventral neural tube patterning (Echelard, et at.. Cell, 21' 1417- 
1430, 1993; Krauss, eioL, supra; Rodink, ei aL, supra); consequently, we examined hh- 
injected embryos for eflFects on /wx-rJ expression in the brain. 

In the zebrafish at 22 hours of development, pax-6 is expressed in dorso-lateral regions 
of the diencephalon and in a ventro-lateral donuun of the hindbrain and spinal cord that 

15 excludes the floor plate and adjacent cells (Krauss, et al., siqtra: Puschel, et ai, supra). 
This pattern of expres»on is redprocal to that of both twhh and in the diencephalon 
(compare FIGURES 14q and Hi) and in the hindbrain. hh "RHA iiqedSon caused 
repression of pax-6 in the more vmtral domain in the diencephalon, while more dorsal 
expression pa^sted. In addition, pax-6 e7q>ression was significantly reduced vmtrally 

20 in rhombomeres 1, 2, and 4 and, in some cases, was completely abolished in these 
rhombomeies. The repressing effect of ectopically expressed hh and pax'6 in normal 
embryos are due to repression of pax'6 by nearby hh expressing cdls. 

Since absmce of/ux-tf expression is a feature of the ventral midline, repression of /xzr-tf 
in lateral po^ons suggests ventralization. Consequently, twhh was injected into 
25 embryos for analysis of induction of a floor plate marker, F-^ndin (Riddle, et al, 
supra). As described above, ectopic twhh induces F-^ndirt expression at more dorsal 
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levcls in the midbrain and anterior hindbrain. The eflfects of hh upon e)q>r^on of both 
pax-6 and Fspwidin indicate a ventralization of the brain. Adoption of ventral cell 
identity lateral cdls might explain thdr fiulure to form ventricles (FIGURE 1 Sa-f). 

The ventralizing activities of twHh confirm and extend those previously reported for 
sMtAMi-J chus genes of diicken. zdirafish, and rat (Echelard, et aL, supra, Krauss. et 
al, supra, Rodink et al., sierra). The earfy restriction of twhh to midline nairal 
progenitors, however, suggests that it may play a spedfic role in the homeogentic 
mechanisms of floor phite nuinteoanoe and e^qianaon (Pkczek, «/ al. , Dev. , 112:205-2 1 8, 
1993). In the zebrafish, wild type cdls in tyclops hosts can contribute to and induce 
acQacent odis to fonn floor plate; but only v/hen the transplanted cells populate the neural 
plate and not the notochord (Hatta, et a!.. Nature, 252:339-341, 1991). We have 
demonstrated that, in cyclops mutants, nudline expresaon of twhh is lost while asfeA 
expression is maintained in the notochord (FIGURE 18; Krauss. etal.. supra for sMi), 
taken together, these results suggest that the homogenetic floor plate signal lost in the 
cyclops mutant nwy be encoded by the twhh gene. In the diick and rat, the floor plate 
retains auto-inductive potential long after the loss of floor plate inducing properties by 
the notodwid, desinte ccHidnued expresaon oiMt/vhhl in the notochord (Roelink, et aL , 
supra, Pbczek, et al., supra, Yamada, et al.. Cell, 22:673-686, 1993). Although no 
h(»nologues rfthe n^/tft class have been rqioited in other vatd)rates. agression of other 
hh homologues in patterns more like those of twhh might help explain these 
discrepandes. 

EXAMPLE 16 

TWO DISTINCT SIGNALING PROTEINS PERIVEFTIOM 
THE <M>AA.ENCnnFD PRECURSOR 

EndogOKNJS hh pratdn in Drosophila is fund predominantly as an amino- and a carboxy- 
tennmal firagment (N and C, respectively) derived by an internal auto-proteolytic 
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cleavage of a larger precursor (U for uncleaved), which also occurs in vivo but at lower 
levels (Lee, et aL, supra). Determinants within the amino-terminal domain appear not 
to be required for auto-proteolytic activity, whereas mutations affecting the carboxy- 
terminal domain can block auto-proteolysis and reduce activity in vivo (Lee, ei aL, 
supra). The auto-proteolysis is blocked by a substitution of alanine for the histidine 
normally present at position 329. This histidine is absolutely invariant in alignments of 
all known hh genes, and its scqumce context suggests a catalytic role in auto-proteolysis 
(Lee, et aL, supra). 

FIGURE 17 shows zebrafish twiggy-winkle hedgehog derivatives. 17(a) Cartoons of 
various twhh open reading frames. SS (shaded) is the predicted N-tenmnal signal 
sequMce for secretion of these proteins and encon^asses the first 27 ammo acids of each 
open reading fi-ame. The arrow indicates the predicted internal ate of auto-proteolytic 
cleavage. Amino acid residue numbers are according to Figure 13b, The filled triangle 
denotes the normal termination codon for the twhh open reading fi^e. Construct Uha 
contains a mutation that blocks auto-proteoly^s (the histidine at residue 273 is changed 
to an alanine; see Lee, J.I, et al., supra ). Construct U356|,a contains a stop codon in 
place of amino add residue 357 as well as the H273A mutation in Uha Constmct N 
encodes just the first 200 amino acids of twhh. Construct C has had the codons for 
readues 3 1 - 1 97 deleted. 1 7(b) shows in vitro translation of the e>q)re^on constructs 
shown schematically in part a. Constructs were translated in vitro in the presence of 
methionine and analyzed by autoradiography after SDS-PA(^. The protein products are 
shown schemadcaDy to the left. Lanes land 6; Auto-proteolysis of the fiill-length(Uss) 
protein creates two fi-agments, an N-terminal fiagment (Nsg) and a C-terminal Augment 
(C). Lane 2: Construct Uha only makes an uncleaved iorni oi twhh protein that 
comigrates with Uss twhh via auto-cleavage. Lane 5: Construct C encodes processed and 
unprocessed forms which are visible as two bands migrating closely together. The 
bottom band is the C protem made finom auto-proteolysis of the U^s (a3 1-197). All 
constructs wwe made by in vitro mutagenesis of expression construct TTTSfwA/r (see 
FIGURE 15) uang the method of RPCR, The sequence of all constructs were confirmed 
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by <Iideo?Qr sequencing. In \ntro translations were performed according to manufacturer's 
instmctions (Promega). 

The vertebrate hh proteins encoded by shh, twhh and also undergo 

auto-proteoiysis to yield two smaller species from a single larger precursor ^e, et ai, 

5 supra\ Chang, et al„ supra\ see lanes I and 6 in FIGURE 17b). The invariant histidine 
to alanine mutation to generate a constmct encoding a form of the twhh protein that is not 
auto-proteolytically cleaved (Uha) ^ve also introduced a nonsense codon and, 
deleted a segment of coding sequence to generate constructs that produce either the 
amino- or the caxfooxy-^terminal domains of twhh (N and C, respectively; see lanes 4 and 

10 5 in FIGURE 17b}; constructs are schraiatically diagrammed in FIGURE 1 7a). To target 
these proteins to the secretory pathway, all constructs retained the normal twhh signal 
sequence. 

Synthetic mRNAs transcribed from these constructs were injeaed to examine the role of 
processing and to assay the activities of individual protein fragments; the results are 

15 nmimarized in Table I and are based on the activities presented in FIGURE 1 5. The most 
striking obndusion from these ^eriments is that N and C both exhibit activity, and that 
these activities are distinguishable. Thus, although both N and C are capable of 
ectobpicaliy activating /mdt-^ in the developing eye, tho-eby providing an internal 
irgection control, only N was capable of eflSdently repressing pax-C (FIGURE 1 6). Later 

20 effects on lens development were also more extreme for N, consistent with the role of 
/lox-tf in lens development suggested by its mutant phenotypes in mice. (See Ton, C.C., 
etaL, Ceff 62:1059-1074, 1991;Glaser,T., etaL.Nat Genetics Z232'239, 1992; HUl, 
R.E.. et aL, Nature 354:522-525, 1991; Hogan, B.L., et al, X EmbryoL Esip. Morph,, 
22:95-110, 1986; and Hogan, B.L., <?/a/„ Z)eve/cpwen/, lO^Suppl : 115-1 19, 1988.) 

25 In considering the activity of delta N-C, it is important to recognize the activity of 
endogenous hh genes in these ^mments, vMch are inhibited by delta N-C and 
fragments thereof (see Example 1 8 and FIGURE 1 8 for fiirther discussion) 
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The uncleaved Uha protein is only somewhat less active than C in inducing but it 
also was not able to repress pcDc6 efficiently (FIGURE 16). The latter is partiailarly 
notable since the Uha protein (U356^ ; see FIGURE 17a, b) has activities not 
agnificantly different from N (FIGURE 16). Thus, in addition to canying detemiinants 
5 important f<H- auto-proteolyss and /MK-2induction, the C-toininus also contains a domain 
inhibitory to N-terminal fiinction when in the context of the uncleaved hh protein. The 
C-teramis can also inhibit N action by an intermolecular mechanism (Lai, et al, supra). 
The existence of sudi an inhibitoiy domain in C suggests that if autoprpteolyis c£ui be 
modulated, such modulation might regulate the activity of Mt in vivo, TMs possibility 
10 higMi^ts the importance of ascertaining the processed state of Wr protems expressed in 
any particular patterning center to understand the potential hh activities generated. 
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APB> PBAm ff ATTERIHINC 

In understanding the normal roles of N and C in eye and brain patterning, the N and C 
5 derivatives of the Drasophila hh gene may offer some in^ght. The Drosophila N 
derivative is retamed dose to its embryonic site of synthesis in a segmenlally striped 
pattern (Tabata and Romberg, Cell, 26:89-102, 1994; Taylor, et al., Mech. Dev., 42 89- 
96, 1993), is cell-associated when e7q>ressed in cultured cells, and is effectively bound 
by heparin agarose in vitro, suggesting the possibility of extracellular matrix association. 
10 The C-terminal fragment, in contrast, is not bound effectively by heparin agarose, is 
almost quantitatively released into the culture supernatant of expres^ng cultured cells, 
and is only diffusely localized in anbryos. Although the activities of individual 
fragments have not been assayed, the biochemical differences and tissue distributions of 
Drosophila N and C may account for the short and long range nature of the fiinctions 
15 associated with /lA during />%i5<9>/ri/a development. 

Although the tissue distributions of zebrafish N and C are not known, their activities in 
ectopic e}q)ression assays are also suggestive of short- and long-range fimctioiis when 
con^dered in the context of nonnal repression patterns of pcDc-2 and pax^ The 
normal gradient of pax-2 oq>resdon in the optic vesicle extends a substantial distance 

20 from its maximum adjacent to the site of hh e3q>ression in the protuberance; the ability 
of ectopic C to activate pax-2 thmfore suggests that, consistent with the distribution of 
C in Drosophila, zd)rafish C may cany out a long-range fimction. Repression of 
endogenous pax-^ repression, in contrast, appears to be a short-range fimction since pax- 
6 expression occurs close to endogenous hh expression. EflScimt repression ofpar-6 is 

25 an attribute of constructs producing N, and a short-range function for N would be 
consistmt with the distribution of N m Drosophila, 
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Two types of /eA-dependent activity have been reported for Mi-transfected cultured cells. 
One is the ^parmt contact-dependent induction of floor plate markers (Roelink, H., et 
a/.. Cell 76:761-775, 1994); the second induction of sclerotome maiicers in presomitic 
mesoderm, is difiiisible and acts at long-range. 



1. Materials and Methods 

cDNAs encoding fiiU-length XCTOpus hedgehogs^ or encoding amino terminal or carboxy 
terminal domains linked to secretory leader sequences were transcribed in vitro to yield 
translatable messec^ RNA. The synthetic messenger RNAs, and control mRNAs, were 
microinjected into the animal poles of cleavage stage Xenopus embryos, which were 
allowed to develop to the blastula stage, at vAnoh time the animal cap explants were 
prepared fiom the upper one fourth of the embryo. These blastula tap explants were then 
cultured in vitro in physiological saline in the presence or absence of the transforming 
growth factor beta ftmily member, recombinant human activin A. All explants were 
allowed to devdop imtil control embryos had grown to noirula stage, or to tadpole stage. 
Importantly, blastula caps left untreated diflferentiate from ectoderm into atypidal 
epidermis. Blastula caps treated with activin differentiate into mesodermal and neural 
cell types. Thus, the question was wfaetha- hedgdiog, or its proteolytic derivatives, would 
change the difiE^entiadon of cells away from becoming epidermis, and into aiiother cell 
type. A second question was whether hedgehog can work with activin to alter the normal 
response of the tissue to either ftctor by itself 

Explants were then extracted to yield mRNA by methods commonly used by those of 
skill in the art, whidi was used as template with reverse transcriptase to yield cDNA .^ The * 
cDNA was then used as tenq)late with various sets of primers for PGR for specific genes, 
reverse-transcriptase-polymerase chain reaction, or RT-PCR. This results in specific 
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amplification of radioactive products whidi are diagnostic for the presence and level of 
the messenger RNAs which were present in the explants. Samples were separated on 
polyacrylamide gels, which were ejcposed to X-ray film to yield the bands shoivn in the 
figures. Thus, the darker bands correspond to a greater level of the specific mRNA. 

FIGURE 18A and B demonstrate that hedgehog induces pituitaiy and antoior brain 
genes, and can cooperate with activin or with neural inducers such as noggin and 
foUistatin which are induced by activin to elevate expression of these genes in explarited 
embryonic tissue. All odd numba-ed lanes lack reverse transcriptaise in the RT-PCR 
reaction and are negative controls. All even numbered lanes have this enzjone, and thus 
give specific bands to mRNA. In Panel A, Lanes 1-2 are control blastula caps, lanes 3-4 
are Xmopus hedgehog-expressing blastula caps, lanes 5-6 are control blastula caps 
treated \wth activin, lanes 7-8 are hedgdiog-expressing blastula caps treated with activin, 
and 9-10 are proIactin-e>q)ressing blastula caps treated with activin to serve as a control 
for simply expresdrig a secreted protein in the blastula cap. The primers used for the 
ass^ are shown to Ae left of each pand, XAG 1 is a cement gland marker, XANFIB 
is a pituitary marker, otx-A is an anterior brain marker, cn-2 is a midbrain-hindbrain 
boundary marker, krox 20 is a rhombom^e-specific hindbrain marker, HIHbox 6 is a 
posterior hindbrain marker, NCAM is a general neural maricer, activin is a control for 
mesoderm induction, and dongation fiictor is a poative conti*ol to shown that all even 
numbered lanes did in fiict have cDNA present. 

The panel labelled XANFIB detects a pituitary gene. Lane 4 (panel A) shows that 
hedgehog induces this pituitary marker, and thus likely pituitary cell types, in blastula cap 
explants (see also FIGURE 20, lane 6, for a strongo- signal showing tfiis), when 
con^uired to control explants in the absence of hedgehog Qmc 2), which do npt express : 
this goie. Lane 6 shows that explants treated with acdvin, in the absence of hedgehog, 
also express the pituitaiy gene. Lane 8 shows that explants treated with both hedgehog, 
and with acdvin, give highest levels of the pituitary gene. Lane 10 proves that this efifect 
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of hedgehog is specific, since prolactin, another secreted protein, does not lead to this 
elevated level of pituitaiy gene. 

The panel labelled OTX-A detects this anterior brain gene. Lane 4 (and 6 in Figure 20) 
shows that hedgehog can induce this neural-specific gene. Lane 8 shows that the level 
of this neural gene is highest in tissue treated with both activin and hedgehog, relative to 
hedgehog alone (lane 4), or activin along (lane 6), and control explants do not express this 
gene (lane 2). Agam, this eflfect is spedfic to hedgehog, since prolactin (lane 10) did not 
lead to elevated expression of this gene. The panel labelled XAG-1 detects a cement 
gland-spedfic gene, and lane 4 shows that hedgehog induces this gene at high level. 

In panel 18B, embryos were injected with N or AN-C, and some ammal cap explants were 
treated with activin before culturiqg until abling embryos reached tailbud stage. Lanes 
1, 2: control arumal caps fiorn uninjected embryos. Lanes 3, 4: control animal caps fi'om 
uninjected embryos, treated with activin. Lanes 5, 6: animal caps from embryos injected 
with N and treated with activin. Lanes 7, 8: animal caps fix>m embryos injected with AN- 
C and treated with activin. Whereas N displays activities in activin-treated explants 
amilar to those of X-bhh (see B) AN-C produces the opposite eflfect, decreasing anterior 
and increasing posterior neural marker expression. As shown in Figure 18B, N behaves 
like X'bhh in that it induces elevated levels of XANF-2 and Otx-A (lane 6) relative to 
control activin-treated animal caps (lane 4). Moreover, N also leads to a decrease in the 
expression of more posterior markers, such as krox-20 and X/Htor-6, as observed 
followmg irijection ofX-bhh, In contrast to the activity of N (Fig. 4C, lane 6), AN-C 
decreases the expression of the anterior neural genes XANF-2 or Otx-A (Fig. 4C, lane 
8) in activin-treated animal c^)s when compared to uninjected controls (lane 4). 
Moreover, AN-C also leads to an increase in the exprei||^iori of more posterior markers, 
such as £>f-2 and X//rftar-6. 

FIGURE 19 shows modifies the anteroposterior pattmi of neural gme expr^sion 
in explants under the influence of endogenous non^al inducers. (A) Isolation of dorsal 
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explants from injected embiyos for the preparation of Keller sandwiches (Keller and 
Danilchik, 1988; Dpniach, et al., 1992; redrawn from Doniach, 1993). (B) Keller 
sandwiches were made from uninjected Oanes 1 and 2) and X-AM-injected (lanes 3 and 
4) embro^ total RNA was isolated when control embiyos reached stage 20, and RT-PCR 
5 was used to analyze the expression of XAG-1 and neural markers. XAG-1 is a cement 
^and maite, XANF-2 is an anterior pituitary marker, Otx-A is a forebrain marker, £>i-2 
demarcates the midbrain-hindbrain boundary/ Krox-20 marks rhombomeres 3 and S of 
the hindbrain and XlHbax-S is a spinal cord marker. N-CAM is a general neural marker 
whose expression is not restricted along the anteroposterior axis. The EF-la control 
10 demonstrates that a comparable amount of RNA was assayed in each set. Note that 
expression of XAG-1 and antmor neural markers is stimulated by X-bhh treatment, 
whereas expression of posterior nmiral maricers is suppressed. 

FIGURE 20 demonstration of dififcrwitial activities of N and C domains of hedgehog 
proteins. As in FIGURE 18 above, odd numbered lanes are negative control lanes, and 

15 positive numbered lanes diow specific gene expression for the markers described above. 
The N domain of hedgehog is encoded in the construct called Xhhl208 (lane 8), and the 
C domain is encoded in the constmct called Xhhldeha 27-208 (lane 10). The construct 
Xhhll-1270A (lane 12) is specifically mutated so that it is unable to undergo self- 
processing. The ability of the N and C domains to induce the genes described above is 

20 compared to control blastula c^ expkmts (lane 4), entire onbryos as a positive control 
(lane 2), blastula cap explants expressmg a mutated hedgehog as a negative control (lane 
14), blasteila caps expressing the entire hedgehog 1 (lane 6), and blastula cap explants 
treated with an ind^endent neural inducer, noggin (lane 16) (discovered by Richard 
Harhmd at University of Calift>rnia at Berkel^). 

25 Examining the first panel fisr the cemmt gland marker XAG-1 clearly shows that intact 
hedgdiog (lane 6) and the N domain 0ane 8) and the processing defective hedgdiog (lane 
12) are much better than indudng the cement gland than is the C domain (lane I). ^ 
Examining the second panel demonstrates that the C domain (lane JO) is better at 
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imludng the pituitary gene XANFIB than is the N domun (lane 8). Since the N domain 
induces the XAG^l marker better, described in point A above, the two results together 
clearly demonstrate that the N and C domains have distinguishable activities. 
Examination of the remaining panels shows that all described activities of the normal 
hedgehog (lane 6 ) can be defined in terms of the activities of the N and C domain. 

Exarnining the third panel, for the fbrebrain gene otx-A, shows that both the N domain 
(lane 8) and C domain (lane 10) induce amilar levels of this gene, but the processing 
defective hedgehog (lane 12) is better than either at inducing this gene. 

Examining the fourth panel of this figure (NCAM), (as wdl as the FIGURE 18 panels 
EN-2, krox20, XIHbox6, and NCAM), shows that hedgehog does not mduces these more 
posterior neural genes. Notably, noggin (lane 16) is able to induce pituitary gene and 
forebrain gene, but it also induces the general neural gene, NCAM, which hedgehog does 
not. This clearly shows that hedgehog is a distinct activity fi^om the neural inducer 
noggin, and has a more restricted ability to induce neural goies. 

Experirnents in tfie Xmopus embryo were conducted by irijecting fiill-length hedgehog 
RNA, and immunopredpitatir^ with a C-domain specific antibody, which proves that iiiU 
length hedgehog does in fiict get processed in vivo in vertebrates, consistent with the data 
shown in earlier Exanq>les in Drosophila, Thus, the ideas for the utility of detecting 
hedgehog N and C domains is based on knowledge that such domains do i^ppear through 
hedgehog processnig in vertebrates. Moreover, the knowledge that hedgehog processing 
does occur in vivo naturally raised the question of whether the resulting N and C domains 
have independent activity. 

The results in FIGURE 18 are novel msofir as they establish that the activity of hedgehog 
in inducing a pituitary gene, and an anterior brain gone, may be enhanced by the TGFp 
family of growth fiurtors. This enhancement likely ^plies to the N and C domaihs 
desaibed in FIGURE 20, since the genes analyzed are the same. This enhancement is due 
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to hh synergizing with neural inducing ftctors which are themselves induced by TGF-p 
family members, including but not limited to such molecules as noggin and follistatin. 

The data in FIGURE 20 makes several impoTUoA pomts. First, the date show that the N 
and C domains have different though somei^t ova-lapping activities, and that the N and 
5 C acthdties added togettier account for all of the observed activity of the intact hedgehog 
protein. Thus, any clinical or diagnostic uses of hedgehog might be improved by use of 
the N or C domain, as one generally wishes to use the smallest protein which has ap ; 
activity for clinical work, as it is less likely to evoke adverse immune responses, or other 
adv&se side eflbcts. Second, the date show that the C domain is better than the Ndoniain 

10 in inducing pituitary gene CTCpression and, since it has less induction of cement gland 
genes that intect hedgehog, or N domain, it suggests that the C domain might be useful 
in clinical situations where one wishes to enhance the development or expr^on of the 
pituitary as specifically as possuble. As the pituitary is the source of a number of 
hormones, any treatment for enhancing pituitery cdl growth and activity would ideally 

15 have as few side effects as possible, and the C domain is thus a viable candidate for 
therapies with enhanced pituitary cell growth and function in nund. Third, relating to 
studies regarding noggin, FIGURE 20 shows dearly that while both hedgehog and noggiri 
can induce pituitary gene expression, hedgehog is more specific, since, hedgehog do^ not 
induce the general neural marker NCAM, \;t^ereas no^in induces NCAM as well as 

20 pituitary. Fourth, the hedgehog which was mutated to prevent processirig (lane 12) is as 
active as fuU-length and wild-type hedgehog (lane 6) in indudrig pituitary gene 
expression, but the processing defective hedgehog is better at iriducing the forebrain 
matker otx-A. Thus, for some clinical applications of hedgehog in inducing specific cell 
types, it is possible that the procesdng-d^ective hedgehog will be superior compared to 



25 normal 




FIGURE 21 shows AN-C interfoes with X-bhh and N activity in animal cap explants. 
Embryos were injected with various RNAs, animal cap explants w&re dxltured uridl 
sibling embryos reached tailbud (stege 25), at which time RT-PCR was used to analyze. 
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the expression of the cement gland marker XAG-1 and the control RNA, EF-la. Lanes 
1, 2: control animal cqis from uninjected embryos. Lanes 3, 4: aiumal caps from embryos 
injected with both X^bhh and prolactin RNAs Lanes 6: animal caps from embryos 
injected with box X-bhh and AN-C. Lanes 7, 8: animal caps from embryos injected with 
5 both N and prolactmg RNAs. Lanes 9, 10: animal caps from embryos injected with both 
N and AN-C. The N and X-bhh experiments were conducted independently and thus 
td^sohite levels in lanes 3-6 should not be compared to those in lanes 7-10. Note that the 
induction of XAG-1 expression by X-bhh or N is reduced by co-injection of AN-G. 

An internal deletion of X-bfA (aN-C) blocked the activity of X^bhh and N in e?q;>lants and 
10 reduced dorsoanterior structures in embryos. As elevated hh activity increases the 
e>q>res»on of anterior neural genes, and as aN-C reduces dorsoanterior structures, these 
complementary data support a role for hh in neural induction and anteroposterior 
patterning. 

aN-C deletes amino adds 28-194 of X-bhh. The primary translation product is predicted 
15 to undergo signal sequence cleavage removing amino acids 1-23, and to undergo 
autoproteolysis. Based on the cleavage site in Drosophila hh (Porter, ei aL, Nature, 
124:363, 1995) autoproteolysis would gen^ate a C domain of X-bhh amino acids 198- 
409, as well as a predicted seven annno acid polypeptide, representtr^ amino acids 24-27, 
and 195-197 (Lai, et al, Development ]21:2^^% 1995). Analysis of the effect of aN-C 
20 on noiral markers was by standard methods including Northern blot analysis and in situ 
hybridization (Lai, et al., supra^ incorporated herein by reference). 

Although aN-C does not induce the cement gland marker XAG-1, it decreases the 
expression of anterior ectodermal and neural markers in activin-treated anirnal caps 
Thus, aN-C has the capacity to afifect neural patterning. aN-C also promotes an increase 
25 in posterior neural markers in activin-treated animal caps. Mixing aN-C with N or full 
ler^th X^hh at a 1 : 1 ratio led to a dramatic inhibition of the induction of cement gland 
in animal cap assays, supporting the hypothesis that aN-C interfered with X-hh. 
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The above disclosure generally describes the present invention. A more complete 
undo^tanding can be obtained by reference to the following specific examples ^ch are 
provided herein for purposes of illustration only and are not intended to limit the scope 
of the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: The Johns Hopkins University School of Medicine, et al 
(ii) TITLE OF INVENTION: NOVEL HEDGEHOG-DERIVED POLYPEPTIDES 
5 (iii) NUMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 
10 (C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 
15 (A) MEDItJM TYPE: Floppy disk 

(B) COMPUTER: 1^4 PC connpatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
20 (A) APPLICATION NUMBER: PCT/US95 

(B) FILING DATE: Ol-DEC-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Haile, Lisa A. 
25 (B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: 07265/043W01 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-5070 

(B) TELEFAX: 619/678-5099 
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(2) lUFORM/LTION FOR SEQ ID N0:1: 

U) SEQUBHCE CHARACTERISTICS: 

(A) I«BNOTKt 144 base pairs 

(B) TYPE: nucleic acid 
b (C) STRANDEDNBSS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDKA 

(ix), FEATURE: 

(A) HAHE/KBY: CDS 
10 (B) LOCATION: 1..142 

(xi) SEQUENCE DESCRIPTION: SEQ ID M0:1: 

GTG AAA CTG CGG GTG ACC GAG CCC TGG GAC OAA GAT GOC CAC CAC TCA 
48 

Val Lys Leu Arg Val Thr Glu Pro Tip Asp Glu Asp Gly His His Ser 
15 1 5 10 15 

CAG GAG TCT CTG CAC TAC GAG GGC C6C GCA GTG GAC ATC ACC ACG TCT 
96 

Gin Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr Ser 
20 25 30 

20 GAC CGC GAC CGC AGC AAG TAC GGC ATG CTG GCC CGC CTG GCG GTG O 

142 

Asp Arg Asp Arg Ser Lys Tyr Gly Met Leu Ala Arg Leu Ala Val 
35 40 45 

AG 

25 144 

(2) INFORMATION FOR SEQ ID NO:2: 

*• * ■■ ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 
30 {C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
35 (B) LOCATION: 1..142 

Ixi) SEQUENCnS DESCRIPTION: SEQ ID Nd:2: 

GTG AAG CTG CGG GTG ACC GAG GGC TGG GAC GAG GAC GOC CAC CAC TCA 

Val Lys Leu Arg Val Thr Glu Gly Trp Asp Glu Asp Gly His His Ser 
40 50 55 60 

GAG GAG TCC CTG CAT TAT GAG GGC CGC GCG GTG GAC ATC ACC ACA TCA 
96 

Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp He Thr, Thr Ser 
65 70 75 
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GAC CGC GAC CGC AAT AAG TAT GGA CTG CTG GCG CGC TTG GCA GTQ G 
142 

Asp Arg Asp Arg Asn Lys Tyr Gly heu Leu Ala Arg Leu Ala Val 
80 85 90 

5 AG 

144 ' 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQtJENCE DESCRIPTION: SEQ ID N0:3: 

15 He Ser Ser His Val His Gly Cys Phe Thr Pro Glu Ser Thr 

1 5 10 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

25 ,ser He Ser His Met His Gly Cys Phe Thr Pro Glu Ser Thr 

1 5 10 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

35. Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr 

1 5 10 

(2) INFORMATION FOR SEQ ID N0:6: . 
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10 



15 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONBSS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQX}ENC£ DESCRIPTIC»Y: SEQ ID NO: 6: 

Val Ala Ala Lys Ser Asp Gly Cys Phe Pro Gly Ser Ala Thr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLBCUXiE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Gly Thr 
1 5 10 

(2) INFORMATION FOR SEQ ID N0:9: • * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 
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(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 9: 

Val Ala Ala I-ye Ser Gly Gly Cys Phe Pro Ala Gly Ala Arg Thr 
15 10 15 

(2) INFORMATION FOR SBQ ID N0:10: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) liENOTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOUXSY: both 

10 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

20 (ii) MOLECULE TYPE: protein 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 11: 

Val Ala Ala Lys Thr Gly Gly Cys Phe Pro Gly Glu Ala Leu 
1 5 10 

(2) INFORMATION FOR SBQ ID NO: 12: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

30 (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Leu Gly Val Arg Ser Gly Gly Cys Phe Pro Gly Thr Ala Met 
1 5 10 

(2) INFORMATION- FOR SEQ ID NO: 13: 
35 (i) SEQUENCE CHARACTERISTICS: 
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10 



15 



20 



(A) LENGTH: X4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDKESS : not relevant 

(D) TOPOLOGY: both 

{ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTIC»7 : SBQ ID MO: 13: 

Leu Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) 5TRANDEDNBSS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

His Gly His Gly Cys Phe Thr Pro 
1 S 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Gly His Gly Cys Phe Thr Pro 

X 5 

(2) INFORMATION FOR SBQ ID NO: 16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
(D> TOPOLOGY: both 

(ii) I^LBCULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Lys Ser Gly Gly Cye Phe Pro Gly 
1 5 



(2) INFORM/ITION FOR SBQ ID NO: 17: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino aclds 

(B) TVPB: amino acid 
(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 17: 

Met Asp Val Arg Leu His Leu Lys Gin Phe Tila lieu Leu Cys Phe lie 
1 5 10 15 

Ser Leu Leu Leu Thr Pro Cys Gly Leu Ala Cys Gly Pro Gly Arg Gly 
20 25 30 

15 Tyr Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

Gin Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Lys 
50 55 60 

Tyr Glu Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu He 
20 65 70 75 80 

Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Asn 
85 90 95 

Ala Asp Arg Leu Met Thr Lys Arg Cys Lys Asp Lys Leu Asn Ser Leu 
100 105 110 

25 Ala He Ser Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 

Glu Gly Trp Asp Glu Asp Gly His His Leu Glu Glu Ser Leu His Tyr 
130 135 140 

Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Lys Ser Lys 
30 145 150 155 160 

Tyr Gly Met Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
165 170 175 

Tyr Tyr Glu Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn 
180 185 190 

35 Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Gly Thr VStl " 

195 200 205 

Thr Leu Gly Asp Gly Thr Arg Lys Pro He Lys Aep Leu Lys Val Gly 
210 215 220 

Asp Arg Val Leu Ala Ala Asp Glu Lys Gly Asn Val Leu He Ser Asp 
40 225 230 235 240 
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Phe lie Met Phe He Asp His ABp Pro Thr Thr Arg Arg Gin Phe He 
245 250 255 

Val He Glu Thr Ser Glu Pro Phe Thr Lys Leu Thr Leu Thr Ala Ala 
260 265 270 

5 His Leu Val Phe Val Gly Asn Ser Ser Ala Ala Ser Gly He Thr Ala 
275 2B0 285 

Thr Phe Ala Ser Asn Val Lys Pro Gly Asp Thr Val Leu Val Trp Glu 
290 295 300 

ASP Thr Cys Glu Ser Leu Lys Ser Val Thr Val Lys Arg He Tyr Thr 
10 305 3X0 315 320 

Glu Glu His Glu Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He 
325 330 335 

He val Asp Glh Val Leu Ala Ser Cys Tyr Ala Val He Glu Asn His 
340 345 350 

15 Lys Trp Ala His Trp Ala Phe Ala Pro Val Arg Leu Cys His Lys Leu 
355 360 365 

Met Thr Trp Leu Phe Pro Ala Arg Glu Ser Asn Val Asn Phe Gin Glu 
370 375 380 

ASP Gly He His Trp Tyr Ser Asn Met Leu Phe His He Gly Ser Trp 
20 385 390 395 400 

Leu Leu Asp Aarg Asp Ser Phe His Pro Leu Gly He Leu His Leu Ser 

405 410 415 

(2) INFORMATION FOR SBQ ID NO:18: 

(i) SEOXIENCE CHARACTERISTICS: 
25 (A*) LENGTH: 418 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGfYi linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

30 Met Arg Leu Leu Thr Arg Val Leu Leu Val Ser Leu Leu Thr Leu Ser 
1 5 10 IS 

Leu Val Val Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Tyr Gly Arg 
20 25 30 

Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
35 35 40 45 

Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 * 80 

40 Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 

85 90 95 



1 . V) 

vnummt tcmmnsm 



10 



15 
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Leu Met Thr Gin Arg Cye Lys Asp Lye Leu Asn Ser Leu Ala He Ser 
100 105 110 

Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

ASP Glu Asp Gly His His Phe Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

Ala Val Asp He Thr Thr Ser Asp Arg Asp Lys Ser Lys Tyr Gly Thr 
145 ISO 155 160 

Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 
165 170 - 175 

Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu Val Ser Leu Gin 
195 200 205 , 

ASP Gly Gly Gin Lys Ala Val Lys Asp Leu Asn Pro Gly Asp Lys Val 
^210 215 220 

Leu Ala Ala Asp Ser Ala Gly Asn Leu Val Phe Ser Asp Phe He Met 
225 230 235 240 

Phe Thr Asp Arg Asp Ser Thr Thr Arg Arg Val Phe Tyr Val He Glu 
245 250 255 

Thr Gin Glu Pro Val Glu Lys He Thr Leu Thr Ala Ala His Leu Leu 
260 265 270 

Phe Val Leu Asp Asn Ser Thr Glu Asp Leu His Thr Met Thr Ala Ala 
275 280 285 

25 Tyr Ala Ser Ser Val Arg Ala Gly Gin Lys Val Met Val Val Asp As^ 
290 295 300 

Ser Gly Gin Leu Lys Ser Val He Val Gin Arg lie Tyr Thr Glu Glu, 
305 310 315 - 32a, 

Gin Arg Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He Val Val 
3d 325 330 335 

Asp Arg He Leu Ala Ser Cys Tyr Ala Val He Glu Asp Gin Gly Leu 
340 345 350 

Ala His Leu Ala Phe Ala Pro Ala Arg Leu Tyr Tyr Tyr Val Ser Ser 
355 360 . 365 

35 Phe Leu Phe Pro Gin Asn Ser Ser Ser Arg Ser Asn Ala Thr Leu Gin 
370 375 380 

Gin Glu Gly Val His Trp Tyr Ser Arg Leu Leu Tyr Gin Met Gly Thr 
385 390 395 400 



20 



40 



Trp Leu Leu Asp Ser Asn Met Leu His Pro Leu Gly Met Ser Val Asn 
40S 410 415 



Ser Ser 
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(2) INFORMATION FOR 8EQ ID N0:X9j 

(i) SEQUENCe CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECOLB TYRE: protein 



(xi) SEQUENCE DESCRIPTION: BEQ ID NO: 19: 

Met Val Glu Met Leu Ijeu Leu Thr Arg lie Leu Leu Val Qly Pbe He 
10 1 5 10 15 

Cys Ala Leu Leu Val Ser Ser Gly Leu Thr Cys Gly Pro Gly Arg Gly 
20 25 30 

lie Gly His Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lye 
35 40 45 

15 Qln Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg 

50 55 60 

Tyr Glu Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu He 
65 70 75 80 

Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly 
20 85 90 95 

Ala Asp Arg Leu Met Thr Cys Arg Cys Lys Asp Lys Leu Asn Ala Leu 
100 105 110 

Ala He Ser Val Met Asn Cys Trp Pro Gly Val Met Leu Arg Val Thr 
115 120 125 

25 Glu Gly Trp Asp Glu Asp Gly His His Ser Lys Glu Ser Leu His Tyr 

130 135 140 

Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys 
145 150 155 160 

Tyr Gly Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
30 165 170 175 

Tyr Tyr Glu Ser Lys Ala His He Cys Ser Val Lys Ala Glu Ash Ser 
180 185 190 

Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His 
195 200 205 

35 Leu Glu His Gly Gly Thr Lys Leu val Lys Asp Leu Ser Hie Gly Asp 

210 215 220 

Arg Val Leu Ala Ala Asp Ala^Asp Gly Arg Leu Leu Val Ser Asp Phe 
225 230 ^ 235 240 

Leu Leu Thr Phe Leu Asp Arg Met Asp Ser Ser Arg Lys Leu Phe Tyr 
40 245 250 .255 
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Val He Glu Thr Arg Gin Pro Arg Ala Arg Leu Leu Leu Thr Ala Ala 
260 265 270 

His Leu Leu Phe Val Ala Pro Gin His Asn Gin Ser Glu Ala Thr Gly 
275 280 285 

Ser Thr Ser Gly Gin Ala Leu Phe Ala Ser Asn Val Lys Pro Gly Gin 
290 295 300 

Pro Val Val Val Leu Gly Glu Gly Gly Gin Gin Leu Leu Pro Ala Ser 
305 310 315 320 

Val His Ser Val Ser Leu Arg Glu Glu Ala Ser Gly Ala Tyr Ala Pro 
325 330 335 

Thr Thr Ala Cye Gly Thr He Leu He Asn Arg Val Leu Ala Ser Cys 
340 345 350 

Tyr Ala Val He Glu Glu His Ser Trp Ala His Ala Ala Phe Ala Pro 
355 360 365 

His Arg Leu Ala Gin Gly Leu Leu Ala Ala Leu Cys Pro Asp Gly Ala 
370 375 380 

He Pro Thr Ala Ala Thr Thr Thr ^ir Gly He His Trp Tyr Ser Arg 
385 390 395 400 

Leu Leu Tyr Arg He Gly Ser Trp Val Leu Asp Gly Asp Ala Leu His 
405 410 415 

Pro Leu Gly Met Val Ala Pro Ala Ser 
420 425 

(2) INFORMATION FOR SBQ ID NO: 20: 

(i) SBQXJKNCE CHARACTERISTICS: 
25 (A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECOLE TYP^: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

30 Met Leu Leu Leu Leu Ala Arg Cys Phe Leu Val He Leu Ala Ser Ser 
1 5 10 15 

Leu Leu Val Cys Pro Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly 
20 25 30 

Lys Arg Arg His Pro Lys Lye Leu Thr Pro Leu Ala Tyr Lys Gin Phe 
35 35 40 45 

He Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu 
- 50 55 60 

GlV Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn . . ^ 
€5 70 75 . _ M 

40 Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Tlir Gly Ala Asp 

85 90 95 



20 



) 

W09€/17924 



) 

PCT/[rS9S(15463 



.105- 



Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala He 
XOO 105 110 

Ser Val Met Asn Gin Trp Pro Oly Val Lys Leu Arg Val Thr Glu Gly 
115 120 125 

5 Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly 
145 150 155 160 

Met Leu Ala Arg Leu Ala Val Glu Ala Oly Phe Asp Trp Val Tyr Tyr 
10 165 170 175 

Glu Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val 
180 185 

Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu 
195 200 205 • 

15 Glu Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Arg Pro Gly Asp Arg 
210 215 220 

Val Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu 
225 230 235 240 

Thr Phe Leu Asp Arg Asp Glu Gly Ala Lys Lys Val Phe Tyr Val He 
20 245 250 255 

Glu Thr Leu Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu 
260 265 270 

Leu Phe Val Ala Pro His Asn Asp Ser Gly Pro Thr Pro Gly Pro Ser 
275 280 285 

25 Ala Leu Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val 
290 295 300 

Ala Glu Arg Gly Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His S^f 
305 310 315 

: Val Thr Leu Arg Glu Glu Glu Ala Gly Ala Tyr Ala Pro Leu Thr Ala 
30 325 330 335 

His Gly Thr He Leu He Asn Arg Val Leu Ala Ser Cys Tyr Ala Val 
340 345 350 

He Glu Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu 
355 360 365 

35 Ala His Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Gly Gly 
370 375 380 

Gly Gly Gly Ser He Pro Ala Ala Gin Ser Ala Thr Glu Ala Xrg Gly 
385 390 395 400 
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Ala Glu Pro Thr Ala Gly He His Trp Tyr Ser Gin Leu Leu Tyr His 
405 410 415 

He Gly Thr Trp Leu Leu Asp Ser Glu Thr Met His Pro Leu Gly Met 
420 425 430 

Ala Val Lys Ser Ser 
435 
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CLAIMS 

1. A substamiaUy pure polypeptide characterized by havmg an anrinp w^^ 
derived from amino terminal amino acids of a hedgehog protein and having at its 
carboxy terminus, a Gl CF deavage ate spedficaily recognized by a proteolytic 
activity of Uie caiboxy terminal fragment of the native hedgehog polypeptide. 

2. An isolated polynucleotide sequence encodmg a polypeptide having an amino 
acid sequence of claim 1. 

3. The isolated polynucleotide sequence of claim 2, encoding a polypeptide having 
an amino acid sequence of tiie polypeptide of claim 1 and having at least one 
epitope for an antibody immunoreactive with the polypeptide of daim 1 . 

4. A recombinant expression vector which contains tiie polynucleotide of daun 2. 

5. A host cdl which contains tiie expresaon vector of daim 4. 

6. An antibody which binds to tije polypeptide of claim 1 and which binds tvith 
ii^norcsactive fragments of the polypeptide of claim 1. 

7. The antibody ofdaim 6, wherdn the antibo(ty is polyclonal. 

8. The antibody ofdaim 6, whwdntfie antibody is monodonal. 



wo 96/17924 PdyUS9S/lS463 



•108- 



9. A substantially pure polypeptide characterized by having an amino acid sequence 
derived from carboxy terminal amino acids of a hedgehog protein and having at 
Its amino terminus, a Gi CF cleavage site specifically recognized by a proteolytic 
activity of the carbo?qr terminal fragment of the native hedgehog polypeptide. 



10. An isolated polynucleodde sequence encoding a polypeptide having an ammo 
acid sequence of claim 9. 

1 1 . The isolated polynucleotide sequence of claim 10, encoding a polypeptide having 
an amino acid sequence of the polypeptide of claim 9 and having at least one 
epitope for an antibody immimoreactive with the polypeptide of claim 9. 

12. A recombinant expression vector vAiich contains the polynucleotide of claim 10. 

13. A host cell which contains the expres^on vector of claim 12. 

14. An antibody which binds to the polypeptide of claim 9 and v^ch binds with 
inmiunoreactive fragments of the polypeptide of claim 9. 

15. The antibody of daim 14, wherein the ahtibody is polyclonal. 

16. The antibody of claim 14, wherein the antibody is monoclonal. 

17. A method for modulating proliferation or difierentiation of neuronal cells, 
con:q)rising contacting the cells with a hedgehog polypeptide. 

1 8. The method of claim 1 7, wherein the hedgehog polypeptide is the polypeptide of 
claim 1. 
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19. The method of claim 17, viAerm the hedgehog polypeptide is the polypeptide of 
claim 9. 

20. The method of claim 17, wherein the modulation is induction of proliferation or 
differentiation. 

21 . Hie method of daim 17. wherein the neuronal cells are substantiaUy derived from 
floor plate neuronal cells. 

22. The method of dmm 21 , wherein the neuronal cells are motor neurons. 

23 . The method of claim 17, wherein the neuronal cells are vertebrate neuronal cells. 

24. The method of claim 23, wherein the vertd)rate is a human. 

25. The method of claim 17. further comprising contacting the cell with a member of 
the TGF-P fiimily. 

26. The method of claim 25, y/hemn the TGF-P is actiwn. 

27. The method of daim 17, further comprising contacting the ceU with a neural 
inducing agent. 

28. The method of claim 27, wherein the neural inducing agent is sdected from the 
group consisting of noggin, follistatin, and a neurotrophin. 

29. An autoproteolytic fusion protdn comprising a first polypeptide including the 
proteolytic domain of the polypeptide of claim 9, a deavage site recognized by 
the first polypeptide, and a second p lypqitide. 
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30. A polynucleotide encoding the fosion protein of claim 29 . 

31. A method for producing an autoproteolytic fosion protein comprising operably 
linking a first polynucleotide encoding a first polypeptide including the 
proteolytic domain of the polypeptide of claim 9 and the deavage site recognized 
by the proteolytic domain, and a second polynucleotide encoding a second 
polypeptide. 
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FIGURE IE 
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FIGURE 11B 
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FIGURE 13 
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FIGURE 16 



Table 1. Eff«a» or ectoptc expression of MIpMk t^^hMH^ (iWih*C/ff^and /sc^on zebiafish embfyonicdevdopmeriL 



6ccopicpax-2meve 
22h 

EciDpicpttr«2 in eve 
Rectuced /mx-6 in eye 
Reduoe<t in vemni fbr«brain 
Ctedueed ptftr-^ (A hindtnin 

UH 

tensibsem 
lens smaller 
Reduced eve pifment 
No midbratn-hfn^sfitn osnsiriaion 

The peroenoges or affected embryos are shown ; ihe mimbers of embryos aiuyed are given in parentfieses. Embryos were analyst 
ac the indicaced dme-poims after injeedon of tyntbedc mRNA. as described In die text and Figs 3«5. These (esults represent* set of 
asseys pcnbrmed togedver for d&ece oompartson of acdvicies; f«peddons yields similar results. See dexc and F^. 7 for description of 
tniUCTption constructs* 
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FIGURE 18 
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FIGURE 19 
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FIGURE 21 
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